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ABSTRACT 

Insulin is used medically to manage diabetes mellitus by frequent subcutaneous injections have turned into a part of life can be enormously 
distressing for patients. Hence, we anticipate to getting rid of the usage of subcutaneous injections completely. Virtual screening method (for 
selection of carriers), Toxtree tool (for toxicity evaluation of carriers) and Discovery Studio tool (for Pharmacopore designing, ADMET analysis, 
designing oral Insulin conjugates and Interaction studies between Insulin Receptor and oral Insulin conjugates) were used for proposed study. 
We have screened 14 competent drugs delivering agents (DDAs) from 7 chemical compound databases. The ADMET and Pharmacophoric 
properties of DDAs were analyzed by drug-informatics’ tools. Consequently, the DDAs were mono, di & poly conjugated by covalent bonding 
with various binding sites of Monomeric and hexameric form of human insulin and insulin-lispro (Humalog®) individually; and novel oral-insulin 
conjugates (OICs) were generated. Its binding efficiency and biological activity with Insulin-receptor were determined. Inulin and Vitamin-B1 
are considered as novel, safe and proficient carriers for oral delivery of Insulin. Insulin Lispro is the remarkable option for oral delivery than 
other Insulin forms. 
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Abbreviations: DDAs — Drug delivering agents; OICs — Oral Insulin Conjugates; Da - Dalton; FDA — Food & Drug Administration; IN-105 - 
Methoxy-poly(ethylene glycol)-insulin conjugates; HIM2 - Hexyl-insulin monoconjugate-2; KEGG - Kyoto Encyclopedia of Genes and Genomes; 
ChEBI - Chemical Entities of Biological Interest; log(Sw) - log Aqueous solubility (Solubility in water); logBB - log Brain-Blood; CYP2D6 - 
cytochrome P450 2D6; HIA - Human intestinal absorption; PSA - polar surface area; TOPKAT - Toxicity Prediction by Komputer Assisted 
Technology; BFGS - Broyden-Fletcher-Goldfarb-Shanno; BBB- Blood brain barrier; ADMET - Absorption, Distribution, Metabolism, Excretion and 
Toxicity; SASA - Solvent Accessible Solvent Area; MTD - Maximum tolerated dose; C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - 
Folic acid (Vitamin M / Vitamin B9); C4 - Vitamin B1 (Thiamin); C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly 
Cysteine; C9 — Chitosan; C10 — Pectin; C11 - Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic acid); C14 - 
Deoxycholic acid. 


1. INTRODUCTION 


Drug delivery is an essential process which leads to innovation of novel carriers to manage oral delivery of some peculiar drugs such as 
proteins, hormones, enzymes, and peptides. These Insulin drugs are requires efficient drug delivering agents (DDAs) for oral formulations to 
reach therapeutic targets against the interruption of biological barriers such as gastric digestion in stomach, proteolysis in intestinal regions, 
and poor bioavailability due to higher molecular weight.2>° Now a day, the subcutaneous injection of insulin formulations has been the core 
therapy utilized for the management of type-1-diabetes as well as type-2-diabetes since insulin’s discovery over 85years ago.2 Multiple doses 
of Insulin subcutaneous injections on the daily basis making patients painful and create ‘fear of pain’ among them.‘ There are several 
disadvantages of this subcutaneous administration such as poor compliance, local discomfort, inconvenience, fright of soreness of multiple 
injections, hypoglycemic risk related to injections, occasional hyper-insulinemia due to overdoses, unsatisfactory metabolic regulations, allergy, 
harrowing, and insulin lipodystrophy at the site of injection.24 In addition, subcutaneously administered insulin is absorbed directly into the 
peripheral circulation without initial hepatic circulation, thereby exposing peripheral targets to higher insulin concentrations relative to the 
liver.4 As of 2014, no products appeared to be successful in the market because of lack of physical stability against biological barriers.4~11 For 
those effective reasons, we intend to screen novel DDAs to design efficient oral Insulin conjugates (OlCs) against diabetes. 
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In our analysis, the monomeric & hexameric form of recombinant human insulin and Humulin were modified by the covalent bonding of 
short-chain DDAs. The modification takes place to the either or both amino-acids of PheB1 & LysB29 in recombinant human insulin, and also 
PheB1 & LysB28 in Humulin. Because, the conjugation of DDAs to Insulin forms ought to have no negative impact on the insulin’s therapeutic 
activity since it has been formerly determined that these residues do not directly contribute in receptor binding.’?-4 On these perspectives, a 
study proven that DDA-recombinant Insulin conjugate (at LysB29) was physically more stable against acidic condition, not undergoes enzymatic 
degradation, less immunogenic, less antigenic, resistance to fibrillation in aqueous solution and improved stability against temperature, pH, 
and interfacial & shear forces’>*®8 when correlated with DDA-recombinant Insulin conjugate (at PheB1), and human native-Insulin.2919 
Moreover, the conjugation of recombinant Insulin with DDA can protect the self-aggregation (dimerization) in aqueous solutions.®° In this 
stance, Humulin has higher resistant capability against self-aggregation & other biological barriers than human recombinant-Insulin and native- 
Insulin.2° The oligomeric-Insulin forms (chiefly hexamers) is not bioactive and the fraction of an amount is absorbed across the capillary 
endothelium into the systemic circulation in the absence of DDA. The dissociation of oligomer into dimmers and monomers is seen as the rate- 
limiting barrier to absorption that effectively affects the preparation’s pharmacological response.** Meantime, the monomeric-Insulin forms of 
is highly bio-available but easily de-nature & de-folded in presence of biological barriers.4> In a study, the in-vivo pharmacodynamic assay 
reveals that there is no loss of biological activity after conjugation of carrier to the either site on the oligomeric form of insulin-B-chain.2? On 
the other hand, the attachment of long-chain DDA (2000Da) decreased the bioactivity of conjugates than sort-chain DDA (750Da). Hence, 
mono-disperse and short-chain DDAs (<750Da) are preferred.® From the earlier studies, we choose that Humulin, monomeric & hexameric of 
human recombinant insulin and short chain DDAs for designing OICs. We report, and analyze the structural series of novel DDAs and OICs by 
drug-informatics. 


2. Methods 


2.1. Carriers 

2.1.1. Screening 

For Virtual Screening?? of DDAs, PubChem Compound,?3 Zinc Database,24 KEGG25 ((Kyoto Encyclopedia of Genes and Genomes)), DrugBank,?° 
ChemSpider,2” ChEMBL,28 and ChEBI29 (Chemical Entities of Biological Interest), compound databases were used. Carrier agents were retrieved 
by the search terms of “Polymer” & “Biopolymer”. The compounds were filtered through virtual screening & biomedical text mining using a 
defined criteria listed below, in part akin to the Lipinski’s rule;3° the DDAs should be : a) mono-disperse, b) short-chain (low-molecular weight 
(<2000Da)), c) biocompatible, d) lipophilic, e) physically stability against gastric acids and proteolytic enzymes, f) inert (no biological activity), g) 
non-toxic. 


2.1.2. Analysis of Pharmacophoric features 

The retrieved compounds were subjected to Pharmacophore analysis by Discovery Studio (Accelrys discovery studio 2.5).31 In Discovery Studio, 
a pharmacophore is defined as the essential features or chemical substructures and their corresponding 3D locations that are responsible for 
the similar biological activities of a set of compounds. Typically, pharmacophore features include hydrophobic (in light blue), hydrogen bond 
acceptor (HBA, in green), hydrogen bond donor (HBD, in Magenta), and active principles. 


2.1.3. Analysis of Physico-chemical properties 
The retrieved compounds were subjected to ADMET evaluation by Discovery Studio. ADMET Descriptors include: 

e Aqueous Solubility: This model uses linear regression to predict the solubility of each compound in water at 25°C. Key to aqueous 
solubility is graded through Level, Value and Drug-likeness as follows: (level 0; log(Sw) < -8.0; Extremely low), (level 1; -8.0 < log(Sw) < - 
6.0; No, very low, but possible) , (level 2; -6.0 < log(Sw) < -4.0; Yes, low), (level 3; -4.0 < log(Sw) < -2.0; Yes, good), (level 4; -2.0 < 
log(Sw) 0.0=""; Yes, optimal); (level 5; 0.0 < log(Sw); No, too soluble). 

e Blood Brain Barrier Penetration: This model predicts blood-brain penetration (blood brain barrier, BBB) after oral administration. This 
model contains a quantitative linear regression model for the prediction of blood-brain penetration, as well as 95% and 99% confidence 
ellipses in the ADMET_PSA_2D, ADMET_AlogP98 plane (ADMET_PSA_2D means Fast polar surface area; ADMET_AlogP98 means Atom- 
based LogP). There are four prediction levels within the 95% and 99% confidence ellipsoids and they are graded through Level, Value 
and Brain-Blood ratio as follows: (Level 0; Very high penetrants (logBB >= 0.7); Brain-Blood ratio greater than 5:1), (Level 1; High 
penetrants (0 < logBB < 0.7); Brain-Blood ratio between 1:1 and 5:1), (Level 2; Medium penetrants (-0.52 < logBB < 0); Brain-Blood ratio 
between 0.3:1 and 1:1), (Level 3; Low penetrants (logBB < -0.52); Brain-Blood ratio less than 0.3:1), (Level 4; Undefined; Outside 99% 
confidence ellipse). 

e CYP2D6 Binding: Predicts cytochrome P450 2D6 enzyme inhibition. The cytochrome P450 2D6 model predicts CYP2D6 enzyme 
inhibition using 2D chemical structure as input. The model classifies compounds as either O or 1 for non-inhibitor or inhibitor and 
provides an average-class-value estimate of confidence. Key to CYP2D6 is graded through Predicted class, value and Description as 
follows; (Predicted class 0; Non-inhibitor; Unlikely to inhibit CYP2D6 enzyme; ADMET_CYP2D6_Probability < 0.5), (Predicted class 1; 
Inhibitor; Likely to inhibit CYP2D6 enzyme; ADMET_CYP2D6_Probability > 0.5). ADMET_CYP2D6_Probability means CYP2D6 score or 
average class value. 

e Hepatotoxicity: Predicts the occurrence of dose-dependent human hepatotoxicity. The hepatotoxicity model predicts potential organ 
toxicity for a wide range of structurally diverse compounds. Key to Hepatotoxicity is graded through Predicted class, value and 
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Description as follows; (Predicted class 0; Nontoxic; Unlikely to cause dose-dependent liver injuries. ADMET_Hepatotoxicity_ Probability 
< 0.5), (Predicted class 1; Toxic; Likely to cause dose-dependent liver injuries. ADMET_Hepatotoxicity_Probability > 0.5). 
ADMET_Hepatotoxicity_Probability means Hepatotoxicity score (average-class value). 

e Intestinal Absorption: This model predicts human intestinal absorption (HIA) after oral administration. Intestinal absorption is defined 
as a percentage absorbed rather than as a ratio of concentrations (cf. blood-brain penetration). A well-absorbed compound is one that 
is absorbed at-least 90% into the bloodstream in humans. The intestinal absorption model includes 95% and 99% confidence ellipses in 
the ADMET_PSA_2D, ADMET_AlogP98 plane. The ellipses define regions where well-absorbed compounds are expected to be found: 
95% of well-absorbed compounds are expected to fall within the 95% ellipse, while 99% of well-absorbed compounds should fall within 
the 99% ellipse. Note that the location of any particular compound does not necessarily imply whether it will be well, moderately or 
poorly absorbed. In general, however, absorption tends to drop off quite rapidly outside the 95% ellipse. These levels are defined by 
the 95% (blue line) and 99% (magenta line) confidence ellipsoids. There are four prediction levels and they are graded through level, 
value and Description as follows: (Level 0; ADMET_Absorption_T2_2D < 6.1261 (inside 95%); Good absorption), (Level 1; 6.1261 < 
ADMET_Absorption_T2_2D < 9.6026 (inside 99%); Moderate absorption), (Level 2; 9.6026 < ADMET_Absorption_T2_2D (outside 99%); 
Low absorption), (Level 3; ADMET_PSA_2D > 150.0 or ADMET_AlogP98 < -2.0 or ADMET_AlogP98 > 7.0; Very low absorption). 
ADMET_Absorption_T2_2D is the Mahalanobis distance for the compound in the ADMET_PSA_2D, ADMET_AlogP98 plane. It is 
referenced from the center of the region of chemical space defined by well-absorbed compounds. 

e Plasma Protein Binding: The plasma protein binding model predicts whether a compound is likely to be highly bound to carrier proteins 
in the blood. Key to Plasma Protein Binding is graded through level and Description as follows; (Level 0; Binding is < 90% (No markers 
flagged and AlogP98 < 4.0)), (Level 1; Binding is > 90% (flagged at 90% or AlogP98 > 4.0)), (Level 2; Binding is > 95% (flagged at 95% or 
AlogP98 > 5.0)). AlogP98 means Atom-based LogP from FastDesc. 


2.1.4. Analysis of Toxicity 

The retrieved compounds were subjected to Toxicity evaluation by Discovery Studio and TOXTREE (by IdeaConsult Ltd (Sofia, Bulgaria)).22 
Toxtree is able to estimate toxic hazard by applying a decision tree approach. The classification result is shown in graphical form (green 
highlight for class | (non-toxic), yellow highlight for class Il (Moderately toxic) and red highlight for class III (Toxic)), as well as in text form. In 
Discovery Studio; TOPKAT models have been re-trained using updated training sets from the legacy TOPKAT (Toxicity Prediction by Komputer 
Assisted Technology). The following models are extensible and are derived using calculable properties; 

e FDA Rodent Carcinogenicity 

e Ames Mutagenicity 

e Rat Oral LD50 

e Rat Maximum Tolerated Dose 

e = Skin Irritancy 

e Skin Sensitization 

e Aerobic Biodegradability 


2.2. Designing Oral insulin conjugates & OIC — IR binding 
For designing, Oral insulin conjugates & Interaction of OIC with IR were carried out through “LibDock” algorithm of Discovery Studio.?3 The 
LibDock docking program performs the following steps using a set of pre-generated ligand conformations and a receptor with a specified 
binding site: 
e Remove hydrogen atoms. 
e — Rank ligand conformations and prune by Solvent Accessible Solvent Area (SASA). 
e Find hotspots using a grid placed into the binding site and using polar and apolar probes. The numbers of hotspots are pruned by 
clustering to a user defined value. 
e Dock ligand poses by aligning to the hotspots. This is performed by using triplets (i.e., three ligand atoms are aligned to three 
receptor hotspots). Poses which result in protein clashes are removed. 
e A final Broyden-Fletcher-Goldfarb-Shanno (BFGS) pose optimization stage is performed using a simple pair-wise score (similar to 
Piecewise Linear Potential). The top scoring ligand poses are retained. 
e Hydrogen atoms are added. 
Hydrogen atoms added in the final step may result in small bumps with the protein. Therefore, minimization should be performed prior to 
using scoring functions that are sensitive to such bumps. 


3. RESULTS & DISCUSSION 


3.1. Carriers 

3.1.1. Screening 

In carrier screening (Flowchart 1), more than 1, 00,000 compounds were retrieved from 7 compound databases. Among those, 14 compounds 
were screened by using experimental text mining & filtration criteria (Table 1). According to data mining, most of the screened compounds in 
Table 1 are monodisperse such as Vitamin B12,34 Vitamin H,*° Folic acid,°* Poly-N-vinylpyrrolidone,®”® Inulin,”® Poly Cysteine,®® Chitosan,9*%% 
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Pectin,1°> Poly (Propylene glycol),*!2113 Poly (Propylene imine),!2®129 Poly (lactic-co-glycolic acid),133-34 Deoxycholic acid‘“1 except Vitamin B1 
and L-Carnitine, because of the lack of experimental data. Molecular weight of polymeric drug delivering molecules are varies based on length 
of chain, but in the case of Vitamins, molecular weights are measurable. Maximum carriers in the retrieved list have shown low-molecular 
weight due to short-chain in structure (<2000Daltons). L-Carnitine (162.113 Daltons), Poly-N-vinylpyrrolidone (11.141 Daltons), Inulin (342.297 
Daltons), Poly Cysteine (121.158 Daltons), Pectin (194.139 Daltons) and Poly(propylene glycol) (76.094 Daltons) are possess low-molecular 
weight while compare with vitamins and other macromolecules in the list. All the listed molecules are biocompatible and biodegradable.?® 
48,53,57,64,69,77,90,97, 106, 114,130,135,142 

Most of the carriers are having efficient oral bioavailability and intestinal permeability such as Vitamin B12,2°39 Vitamin H,4°-48 Folic 
acid,>45> Vitamin B1,5% L-Carnitine,o1© Inulin,’® Poly(propylene glycol),115116 Poly(propylene imine),3+ Poly (lactic-co-glycolic acid)*8139 and 
Deoxycholic acid.144145 Chitosan,99 100102104 and Pectin! are graded as moderately efficient in bioavailability while Poly-N-vinylpyrrolidone’®7+ 
and Poly Cysteine”? are very poor intestinal transport, because of the lack of lipophilicity. Biomedical text mining shows all the listed 
compounds are physically stable against gastric acids and proteolytic enzymes during drug delivery.40485458,65,73,81,94,99,111,117,130,144,145 Among the 
retrieved carriers, Vitamin B12,42 Vitamin H,*° Vitamin B1,°° Poly-N-vinylpyrrolidone,”* Inulin,8283 Chitosan,°° Poly(propylene glycol),12? Poly 
(lactic-co-glycolic acid)*?”? and Deoxycholic acid*4® are chemically inert and they does not undergoes any biochemical transformation & 
aggregation during drug delivery. Through text-mining, we could not found whether Folic acid, L-Carnitine, Poly Cysteine, Pectin, and 
Poly(propylene imine) are inert or not. Based on the concept of toxicity, Vitamin B12,44 Vitamin H,°° Folic acid,>© Vitamin B1,© L-Carnitine,©* 
Inulin,84 Poly Cysteine,2° Chitosan,°? Pectin,?°? Poly (lactic-co-glycolic acid)!4° are non-toxic materials and does not produce any untoward 
reactivity during drug-delivery mechanism. But, Poly-N-vinylpyrrolidone,’> Poly(propylene glycol),12© Poly(propylene imine),132 and Deoxycholic 
acid'4° are moderately toxic based on text-mining. 

The experimental text-mining concludes Inulin, Chitosan and Poly (lactic-co-glycolic acid) are efficient, safe and primary drug delivering 
molecules of drugs that completely fulfill the filtration criteria. Inulin reduces the production of potentially toxic metabolites, induce important 
immune-mediated effects and reduce the cancer risk during drug delivery.8*®” Generally Inulin’? and Chitosan991™ are participating in colon- 
targeting drug delivery, possess very minimum exposure to gastric fluids in the stomach and enzymatic degradation in the small intestine. Folic 
acid, L-Carnitine and Pectin are eligible for drug delivery process, but we could not conclude whether they are inert. Vitamin B1 is an effectual 
and non-toxic carrier, may cause Anaphylaxis at higher doses.®° L-Carnitine & Vitamin B1 does not hold any literature evidence that they 
possess mono-disperse character or not. Vitamin B12 is inert, long-term usage may cause Vit-B12 deficiency.** Vitamin H is inert, but it may 
inert the biological activity of the drug.*9 Poly-N-vinylpyrrolidone,”> and Poly (propylene imine)**? are participating in nanoparticle-based drug 
delivery, and both are moderately toxic. Poly Cysteine is hydrophilic in nature; shows poor permeability across intestinal epithelium;2*%? and no 
research confirmation whether it is inert. Poly(propylene glycol) is an efficient carrier for drug delivery, and inert, overdose may cause skin & 
eye irritation.148122 It may induce Cardiotoxic effects include arrhythmias and cardiac arrest, CNS depression, Renal and hepatic damage has 
also reported. 123 In case studies, toxic symptoms appeared only after frequent doses of propylene glycol, used as a vehicle for medicines, were 
repetitively applied to the skin.124125 In drug delivering mechanism, it eliminates toxic degradation.’° It is less toxic than the parent substance 
(Poly ethylene glycol).!2” Deoxycholic acid is a prominent carrier, which is not inert and moderately toxic; may promote colon tumorigenesis in 
both animals and humans.16 


3.1.2. Analysis of Pharmacophoric features 

Pharmacophores are conceptual description of molecular principles or features that are essential for molecular recognition of a molecule 
through a biological macromolecule. Pharmacophoric features of Drug delivering molecules are illustrated and demonstrated by Discovery 
Studio software (Table 2; Figure 1). Pharmacophore features comprise hydrogen bond donor, hydrogen bond acceptors, hydrophobic centroids, 
aromatic rings, cations, and anions. Among those features, Acceptor, Donor, hydrophobic regions and the number of active principles of carrier 
molecules (Supplementary Figure 1) were investigated based on Lipinski’s rule.7° According to the rule of Lipinski, a molecule should not donate 
more than 5 hydrogen bonds and it should not accept more than 10 hydrogen bonds. Most of the carriers obey the rule and eligible for drug 
delivery except Inulin and Vitamin B12. Inulin posses 11 hydrogen bond acceptors (< 10 as per rule) may leads to accepting electrons 
transferred to it from another compound. Inulin may be an oxidizing molecule, by virtue of its accepting electrons, it itself reduced in the drug 
delivery process. Inulin may undergo permanent chemical alteration through covalent or ionic reaction chemistry, resulting in the complete 
and irreversible transfer of one or more electrons. But in this case, Inulin is a colon targeting molecule; subsequently it does not undergo any 
chemical transformation during drug delivery mechanism7?*! like Chitosan.9°1% Vitamin B12 posses 7 hydrogen bond donors (<5 as per rule) 
may leads to donate electrons to another compound. Vitamin B12 may be a reducing agent, by virtue of its donating electrons, it itself oxidized 
in the drug delivery process. Compare with another Drug delivering molecules, Inulin, Vitamin B12 and Deoxycholic acid contains maximum 
hydrophobic regions (Table 2) that is responsible for efficient intestinal transport.3®3978.144,145 The significant criteria, pharmacophoric active 
principle is more in Folic acid, Vitamin B1, Inulin, Chitosan and Pectin, But Vitamin B12 and Poly(propylene imine) does not hold any single 
active principle in their structure. The analysis of pharmacophoric features suggests that Vitamin B1, Inulin, Chitosan, and Pectin are possible 
Drug delivering molecules for Insulin. 


3.1.3. Analysis of Physico-chemical properties 

ADME descriptors of Drug delivering molecules were evaluated by Discovery Studio and listed in Table 3 (Supplementary Figure 2). Vitamin H, 
Vitamin B1, Poly Cysteine, Poly(propylene glycol), Poly (lactic-co-glycolic acid) and Deoxycholic acid are falls within 95% absorption ellipse that 
shows efficient absorption (level = 0) across intestinal epithelium, concurrently shows the sign of maximum bioavailability in drug delivering 
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mechanism. Poly-N-vinylpyrrolidone and Poly(propylene imine) are falls within 99% absorption ellipse that shows moderate absorption (level = 
1); remaining compounds are falls outside 99% absorption ellipse shows poor absorption (level = 3). In the case of Aqueous solubility, Folic acid 
(-3.378) show extremely high Aqueous Solubility and drug-likeness. Vitamin H (-1.432), Vitamin B1 (-1.335), Poly-N-vinylpyrrolidone (-0.550) 
and Poly(propylene imine) (-0.097) shows optimal Aqueous Solubility and drug-likeness; others show poor drug-likeness. In the point of Blood- 
Brain-barrier (BBB) penetration, Deoxycholic acid (-0.154) is medium penetrant across Blood-Brain-barrier, because it is fall within 99% 
confidence ellipsoids (level = 2), the Brain-Blood ratio is between 0.3:1 and 1:1. Vitamin H (-1.229), Vitamin B1 (-1.253), Poly Cysteine (-1.362), 
Poly(propylene glycol) (-1.039), Poly (lactic-co-glycolic acid) (-1.713) are low penetrants across Blood-Brain-barrier, because it is within 99% 
confidence ellipsoids (level = 3), the Brain-Blood ratio is less than 0.3:1. Other carriers are poor penetrants because they are outside the 95% 
and 99% confidence ellipsoids (undefined level = 4). In the case of CYP-4502D6 binding, the carrier should be non-inhibitor because CYP- 
4502D6 is responsible for metabolism and elimination of drug molecules. All the listed drug delivering molecules are non-inhibitors because 
their ADMET CYP2D6 Probability is < 0.5. In the case of Hepatotoxicity, most of the selected carriers are non-toxic because, their ADMET 
hepatotoxicity probability is < 0.5. But Vitamin B12 (0.509) and Folic acid (0.662) are toxic because, their ADMET hepatotoxicity probability is > 
0.5. Plasma Protein Binding capability should be <90%, then only the unbound molecule can easily penetrate the tissues to reach the active site 
and then to get eliminate. The Plasma Protein Binding character of all listed carriers is low and satisfies the standard value (< 4.0). Based on the 
overall results of Physico-chemical properties, Vitamin H, Vitamin B1, and Deoxycholic acid are superior drug delivering molecules according to 
their efficient solubility in liquid, moderate penetration across Blood Brain barrier, enhanced absorption across intestinal epithelial cells, non- 
inhibition of CYP-4502D6 binding, unbound nature with plasma proteins and low hepatotoxicity. 


3.1.4. Analysis of Toxicity 

Toxic scale is the most significant prediction for carriers in drug delivery mechanism. Toxicity properties such as FDA Rodent Carcinogenicity, 
Mutagenicity, Rat oral LD50 (g/Kg Body weight), Rat maximum tolerated dose (g/Kg Body weight), Skin Irritant, Skin sensitization, Aerobic 
biodegradability and general toxicity were studied by Discovery Studio and Toxtree (Table 4 & Supplementary Figure 3 & 4). The listed carriers 
are Non-mutagen and are Non-carcinogen except Vitamin B1 and Poly-N-vinylpyrrolidone. The predicted skin irritancy is severe for Deoxycholic 
acid, while Vitamin H, Folic acid, Vitamin B1, Poly Cysteine, Pectin, and Poly (lactic-co-glycolic acid) are non-irritants and others are in the 
category of mild-irritant. Vitamin B1 is predicted under strong skin-sensitizer, whereas Folic acid and Deoxycholic acid are weak skin- 
sensitizers; others fall under non-skin-sensitizer. The drug delivering molecules should be biodegradable in an aerobic environment after 
delivering the drug to receptors; otherwise it may leads to untoward reactivity. In this case, Folic acid, Vitamin B1 and Poly(propylene imine) 
are Non-Degradable, others are Degradable under aerobic condition. The TOPKAT model predicts the rat oral acute median lethal dose (LD50) 
in the toxicity test, and the rat maximum tolerated dose (MTD) of all drug delivering molecules. According to general toxicity prediction by 
Toxtree tool, Vitamin B12, Folic acid, Poly-N-vinylpyrrolidone, Chitosan, Poly(propylene glycol), Poly(propylene imine), and Deoxycholic acid 
may highly toxic while others are low in toxic category. Based on the analysis of toxicity studies, we concluded that Vitamin H, Inulin, Poly 
Cysteine, Pectin, and Poly (lactic-co-glycolic acid) are safe carriers for delivering oral insulin molecule. 


3.2. Oral insulin conjugates 

3.2.1. Designing 

Human Insulin Monomer (PDB ID: 1HLS), human insulin hexamer (PDB ID: 1AIO), and Insulin Lispro (PDB ID: 1 LPH) were retrieved from Protein 
data bank and conjugated with all carriers individually. Nikhil J Kavimandan et al. (2006)°9 and Hinds et al. (2000)® suggest that, conjugation of 
carriers with B1Phe, B27Thr, B28Pro & B29Lys amino acids of human Insulin and B28Lys amino acid of Insulin Lispro will be the efficient 
conjugate against ADMET barriers in oral delivery. Based on our computational analysis (Discovery Studio - LibDock), the positive LibDock score 
indicating the competent oral-insulin conjugation. 

Human Insulin Monomer (PDB ID: 1HLS), conjugated individually with all listed drug delivering molecules (Table 5 & Supplementary Figure 
5); among those Inulin was mono-conjugated efficiently with B1Phe of Insulin Monomer, and Poly (lactic-co-glycolic acid) was di-conjugated 
competently with B1Phe & A11Cys of Insulin Monomer. Vitamin B12, Vitamin M, poly-N-vinylpyrrolidone, poly(propylene imine), and 
Deoxycholic acid did not shows any conjugation with amino acids of Insulin Monomer. Rest of the drug delivering molecules form the 
incompetent mono & di-conjugates with A & B chain amino acids of Insulin Monomer. Based on the Binding site (PHE B1) and LibDock score, 
Inulin (117.663) shows the competent oral-insulin conjugation (Figure 2c). Based on the LibDock score, Vitamin H (86.2835) and Vitamin B1 
(94.1144) shows the competent oral-insulin conjugation (Figure 2a & 2b); but in the case of Poly (lactic-co-glycolic acid), the score is poor; even 
it conjugate at PHE B1 (Figure 2d). 

Human insulin hexamer (PDB ID: 1AIO) conjugated individually with all listed drug delivering molecules (Table 6 & Supplementary Figure 6); 
among those Vitamin B1 and Inulin were mono-conjugated efficiently with B29Lys of Insulin hexamer and form the competent mono 
conjugates. Vitamin B12, poly-N-vinylpyrrolidone, and poly(propylene imine) did not shows any conjugation with amino acids of Insulin 
hexamer. Rest of the drug delivering molecules forms the incompetent mono & di-conjugates with A, B, C, D, E, F, G, H, I, J, K and L chain amino 
acids of Insulin hexamer. According to LibDock score, Vitamin M (114.324), Vitamin H (103.231) and Inulin (94.3543) show the competent oral- 
insulin conjugation (Figure 3a, 3b & 3c). Based on the Binding site (LYS B29), Inulin and Vitamin B1 shows the competent oral-insulin 
conjugation (Figure 3c & 3d). 

Insulin Lispro (PDB ID: 1 LPH) conjugated individually with all listed drug delivering molecules (Table 7 & Supplementary Figure 7); among 
those Vitamin M was poly conjugated with A & B chain amino acids of Insulin Lispro such as A1Gly, A2lle, A3Val, A4Glu, A19Tyr, B4GIn, B5His, 
B27Thr, B28Lys, B30Thr. This conjugates efficiently bonding with B27Thr and B28Lys amino acids. It also made the inefficient multiple bonding 
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interactions with Insulin Lispro and may affect the absorption and bioavailability of Insulin Lispro. Vitamin B1 was di-conjugated reasonably 
with Lys B28 & Glu A4 of Insulin Lispro. Inulin was tetra-conjugated fairly with B28Lys, A1Gly, A3Val, and A4Glu of Insulin Lispro. Vitamin B12, 
Vitamin H, poly-N-vinylpyrrolidone, poly(propylene imine), and Deoxycholic acid did not shows any conjugation with amino acids of Insulin 
Lispro. Rest of the drug delivering molecules forms the incompetent mono, di, tetra & poly-conjugates with A & B chain amino acids of Insulin 
Lispro. According to LibDock score and Binding site (LYS B28), Vitamin M (131.57), Vitamin B1 (89.8971) and Inulin (76.2195) shows the 
competent oral-insulin conjugation (Figure 4a, 4b & 4c). 


3.3. Interaction of Oral insulin conjugates with Insulin Receptor (IR) 

Among the designed conjugates, Inulin-Insulin Monomer conjugate, Vitamin B1-Insulin Monomer conjugate, Vitamin H-Insulin Monomer 
conjugate, Vitamin M-Insulin hexamer conjugate, Vitamin H-Insulin hexamer conjugate, Inulin-Insulin hexamer conjugate, Vitamin M- Insulin 
Lispro conjugate, Vitamin B1 -Insulin Lispro conjugate, and Inulin- Insulin Lispro conjugate are selected as capable oral-insulin conjugates to 
interact with Insulin Receptor. In the case of Insulin Receptor, the leucine-rich repeat domain (L1, residues 1-157) and C-terminus of the a- 
chain (aCT, residues 704-715) are Insulin binding surface?4715° and they functions as a signaling element to activate its tyrosine kinase and 
predicted to influence Insulin receptor—Oral Insulin conjugate interaction. In the proposed work, none of oral Insulin conjugates shows 
interaction with aCT while Insulin Monomer - DDM Conjugates (Figure 5a) does not show interaction with L1 domain (Table 8 & Supplementary 
Figure 8). Insulin Hexamer - DDM Conjugates interacts with ARG86 and ASN34 residues of IR (Figure 5b), while Insulin Lispro -DDM Conjugates 
interacts with ARG86, ASN90 and ARG114 residues of IR (Figure 5c) which reflects the efficient binding affinity of Insulin Lispro - DDM 
Conjugates with IR. 

In our analysis, fourteen drug delivering agents were screened and its characteristic features for oral delivery of Insulin were examined. 
Based on the toxicity and conjugation ability with various forms of Insulin, the drug delivering molecules were chosen for developmental 
studies. From the overall results we nominate Vitamin B1 and Inulin are suitable drug delivering agents because: 1) Vitamin B1 completely 
satisfy the defined criteria based on bio-text mining; it shows better pharmacophoric and efficient ADME features; 58.59 it is carcinogen but low 
toxic® as per drug informatics analysis; it shows efficient conjugation with Insulin monomer, Insulin hexamer and Insulin Lispro. Vitamin B1- 
insulin hexamer and Vitamin B1- Insulin Lispro conjugates shows efficient interaction with IR. Vitamin B1 is a highly recommended molecule for 
in-vitro and in-vivo studies. 2) Inulin absolutely satisfy the defined criteria based on bio-text mining; it shows superior pharmacophoric and 
moderate ADME features;7678-81 jt is non-carcinogen, non-mutagen, low toxic®* as per drug informatics analysis. It shows efficient conjugation 
with Insulin monomer, Insulin hexamer, and Insulin Lispro. Inulin-insulin hexamer and Inulin - Insulin Lispro conjugates shows efficient 
interaction with IR. Inulin is a highly recommended molecule for in-vitro and in-vivo studies. 

Moreover, we choose Poly (propylene glycol) is a possible drug delivering agent because it is partially fulfills the defined criteria based on 
bio-text mining; it shows moderate pharmacophoric and moderate ADME features;115116122 jt is non-carcinogen, non-mutagen, moderately 
toxic!2® as per drug informatics analysis. It shows reasonable conjugation with insulin monomer, insulin hexamer, and Insulin Lispro. In earlier 
studies, propylene glycol showed toxic symptoms after the frequent doses & repeated application when used as a vehicle in medicinal 
preparations.12*125 Meantime, in drug delivering mechanisms, it eliminates the toxic degradation,'2° and less toxic than the parent substance 
(Polyethylene glycol),12” because PEG is a frequently used drug delivering molecule for oral insulin delivery2®. Consequently, Poly (propylene 
glycol) may be a possible carrier for further studies. 

In another stand, while compare the binding efficiency of various Insulin forms; Insulin Lispro (LysB28) shows the competent conjugation 
with drug delivering molecules and the resultant conjugates shows therapeutically capable interaction with IR than Insulin Monomeric and 
hexameric form of conjugates. 


4. CONCLUSION 


The oral bioavailability of insulin is 1%. It may be enhanced by novel carriers that deliver insulin to the site of absorption. In the proposed work, 
based on the defined criteria 14 drug delivering molecules were filtered from 7 reputed compound databases and it’s Pharmacophoric and 
ADMET properties were analyzed by Biomedical text mining and drug-informatic tools. The results from the conducted studies concluded that 
Inulin and Vitamin B1 are considered as novel, safe and proficient oral carriers for Insulin. (Polyethylene glycol) is an optional vehicle for oral 
delivery of Insulin. Insulin Lispro is the tremendous option for oral delivery than other Insulin forms. Clinical studies are recommended to 
develop our results. 
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Flowchart 1: Screening process of Drug delivering molecules 
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Figure 1 

Pharmacophoric features of Drug delivering molecules are illustrated by Discovery Studio software. Acceptors (Green in color), Donors 
(Magenta in color) and Hydrophobic Regions (Blue in color) of Pharmacophoric features of all Drug delivering molecules are demonstrated and 
differentiated by color. C1- Vitamin B12 (cobalamin); C2- Vitamin H (Biotin); C3 - Folic acid (Vitamin M / Vitamin B9); C4 - Vitamin B1 (Thiamin); 
C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly Cysteine; C9 — Chitosan; C10 — Pectin; C11 - Poly(propylene 
glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic acid); C14 - Deoxycholic acid 
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Figure 2a 
Bioconjugate from Human Insulin Monomer (PDB ID: 1HLS) and Vitamin H is illustrated by Discovery Studio software. Vitamin H conjugated at 


CYS A11, GLN B4 and HIS B10 aminoacids of Insulin Monomer. The inner figure illustrates the CYS 11 amino acid of Insulin-A chain conjugates 
with Vitamin H. 


Figure 2b 
Bioconjugate from Human Insulin Monomer (PDB ID: 1HLS) and Vitamin B1 is illustrated by Discovery Studio software. Vitamin B1 conjugated at 


CYS A6, CYS A11 and LEU B6 aminoacids of Insulin Monomer. The inner figure illustrates the CYS 6, CYS 11 amino acids of Insulin-A chain and 
LEU 6 amino acid of Insulin-B chain conjugates with Vitamin B1. 
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Figure 2c 
Bioconjugate from Human Insulin Monomer (PDB ID: 1HLS) and Inulin is illustrated by Discovery Studio software. Inulin conjugated at PHE B1 
aminoacid of Insulin Monomer. The inner figure illustrates the PHE 1 amino acid of Insulin-B chain conjugates with Inulin. 


Figure 2d 

Bioconjugate from Human Insulin Monomer (PDB ID: 1HLS) and Poly (lactic-co-glycolic acid) is illustrated by Discovery Studio software. Poly 
(lactic-co-glycolic acid) conjugated at PHE B1 and CYS A11 aminoacid of Insulin Monomer. The inner figure illustrates the PHE 1 amino acid of 
Insulin-B chain conjugates and with Poly (lactic-co-glycolic acid). 
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Figure 3a 

Bioconjugate from Human insulin hexamer (PDB ID: 1AIO) and Vitamin M is illustrated by Discovery Studio software. Vitamin M conjugated at 
GLY A1 and THR B27 aminoacids of Insulin hexamer. The inner figure illustrates the GLY 1 amino acid of Insulin-A chain conjugates with Vitamin 
M. 


Figure 3b 

Bioconjugate from Human insulin hexamer (PDB ID: 1AIO) and Vitamin H is illustrated by Discovery Studio software. Vitamin H conjugated at 
GLN B4 and ARG B22 aminoacids of Insulin hexamer. The inner figure illustrates the ARG 22 amino acid of Insulin-B chain conjugates with 
Vitamin H. 
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Figure 3c 
Bioconjugate from Human insulin hexamer (PDB ID: 1AIO) and Inulin is illustrated by Discovery Studio software. Inulin conjugated at LYS B29 


aminoacid of Insulin hexamer. The inner figure illustrates the LYS 29 amino acid of Insulin-B chain conjugates with Inulin. 


Figure 3d 
Bioconjugate from Human insulin hexamer (PDB ID: 1AIO) and Vitamin B1 is illustrated by Discovery Studio software. Vitamin B1 conjugated at 
LYS B29 aminoacid of Insulin hexamer. The inner figure illustrates the LYS 29 amino acid of Insulin-B chain conjugates with Vitamin B1. 
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Figure 4a 
Bioconjugate from Insulin Lispro (PDB ID: 1 LPH) and Vitamin M is illustrated by Discovery Studio software. Vitamin M conjugated at GLY A1, ILE 


A2, VAL A3, GLU A4, TYR A19, GLN B4, HIS B5, THR B27, LYS B28 and THR B30 aminoacids of Insulin Lispro. The inner figure illustrates the LYS 28 
amino acid of Insulin-B chain conjugates with Vitamin M. 


Figure 4b 
Bioconjugate from Insulin Lispro (PDB ID: 1 LPH) and Vitamin B1 is illustrated by Discovery Studio software. Vitamin B1 conjugated at GLU A4 


and LYS B28 aminoacids of Insulin Lispro. The inner figure illustrates the LYS 28 amino acid of Insulin-B chain conjugates with Vitamin B1. 
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Figure 4c 
Bioconjugate from Insulin Lispro (PDB ID: 1 LPH) and Inulin is illustrated by Discovery Studio software. Inulin conjugated at GLY A1, VAL A3, GLU 
A4 and LYS B28 aminoacids of Insulin Lispro. The inner figure illustrates the LYS 28 amino acid of Insulin-B chain conjugates with Inulin. 


Figure 5a 

Interaction results of Oral insulin conjugates (Insulin Monomer (1HLS)- DDM Conjugates) with Insulin Receptor (IR). It does not show any 
interaction in leucine-rich repeat domain (L1, residues 1-157) and in C-terminus of the a-chain (aCT, residues 704-715). The inner figure shows 
the interaction in LYS310 aminoacid of IR which is not responsible for initiation of therapeutic effect. 
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Figure 5b 

Interaction results of Oral insulin conjugates (Insulin Hexamer (1AIO)- DDM Conjugates)- DDM Conjugates) with Insulin Receptor (IR). It shows 
the interaction in ARG86, ASN34 of leucine-rich repeat domain (L1, residues 1-157) and no interaction in C-terminus of the a-chain (aCT, 
residues 704-715). The inner figure shows the interaction in ASN34 aminoacid of IR which is responsible for initiation of therapeutic effect. 


Figure 5c 

Interaction results of Oral insulin conjugates (Insulin Lispro (1LPH) - DDM Conjugates) with Insulin Receptor (IR). It shows the interaction in 
ARG86, ASN90 and ARG114 of leucine-rich repeat domain (L1, residues 1-157) and no interaction in C-terminus of the a-chain (aCT, residues 
704-715). The inner figure shows the interaction in ARG86, ARG114 aminoacids of IR which is responsible for initiation of therapeutic effect. 
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Table 1 Physco chemical details of DDMs by using experimental text mining & filtration criteria 


Filtration Criteria by experimental text mining 
Is it Is it inert 
Is it Mono- Is it physically while 
Database & disperse Short-chain Is it lipophilic? stability participate in 
s Carrier Identification while (low- Is it (Bioavailability against Drug ; 
No Number participate molecular Biocompatible & ; Efficient gastric delivering mn ae 
in Drug weight Biodegradable? intestinal acids and Conjugate eel 
delivering (<2000Da))? transport) proteolyti (no 
Conjugate? (in Daltons) c biological 
enzymes? activity)? 
C1 Vitamin B12 CHEBI Mono- 1,357 35 Biocompatible & | Oral Stable?738, Non-toxic & 
(cobalamin) 1D:30411 disperse*4 Biodegradable** bioavailability | 4°42 Inert*? Anti 
3b is high3639 mutagenic’? 
44 
C2 Vitamin H CHEBI Mono- 244 Biocompatible & | Efficient Stable*®48 Inert’? Non -toxic®° 
(Biotin) 1D:15956 disperse*> Biodegradable** intestinal 
ie transport*48 
C3 Folic acid | CHEBI Mono- 441.40 Biocompatible & | Oral Stable5*>5 No literature | Non - 
(Vitamin M / | ID:27470 disperse Biodegradable®*> | bioavailability toxic®+5° 
Vitamin B9) 3 is high5*> 
C4 Vitamin B1 CHEMBL ID: | No 265.112 Biocompatible & | Oral Stable*® Inert? Non-toxic® 
(Thiamin) 1547 literature Biodegradable*” bioavailability 
is high>® 
cS L-Carnitine CHEMBL ID: | No 162.113 Biocompatible & | Efficient Stable®t6 No literature | Non-toxic® 
(Vitamin BT) 1149 literature Biodegradable® intestinal 
Aas transport®6 
C6 Poly-N- Pubchem Mono- 11.141 Biocompatible & | Oral Stable’*-73 Inert”4 Moderately 
vinylpyrrolid Compound ID: | disperse®” Biodegradable® bioavailability toxic’> 
one 6917 68 is low7071 
C7 Inulin Zinc ID: | Mono- 342.297 Biocompatible & | Oral Stable’? | Inert®® Non - 
12358861 disperse’® Biodegradable” bioavailability toxic®*®” 
is high”® 
C8 Poly Cysteine | DrugBank ID: | Mono- 121.158 * Biocompatible & | Hydrophilic; Stable No literature | Non -toxic®> 
DB00151 disperse®® Biodegradable*®® | poor intestinal | in the 
. absorption presence 
92,93 of the 
peptidase 
a- 
chymotry 
psin, 
increase 
insulin 
stability 
against 
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degradati 
on 94 
c9 Chitosan ChemSpider Mono- 501.482 Biocompatible & | intestinal Stables Inert99102 Non - 
IDeesiee7s disperse®™ Biodegradable”? | permeability ave toxic??10 
28 7108 is Moderate®* 
104 
C10 | Pectin KEGG ID: | Mono- 194.139 Biocompatible & | intestinal Stable No literature | Non - 
c00714 disperse?> Biodegradable permeability (Colon toxic 
(Blood is Moderate? | specific 
Compatible)?0%109 delivery)" 
0,111 
C11 | Poly(propyle | CHEBI Mono- 76.094 Biocompatible & | Oral Inert!18-122 Moderately 
ne glycol) 1D:53262 disperse??1 Biodegradable’ | bioavailability | Stable ie 
a is hight#516 Less toxic 
(protectio than’ PEG"? 
kb ss 
n of 
encapsula 
ted 
substance 
s from 
degradati 
on)21” 
C12 | Poly(propyle | CHEBI ID: | Mono- Molecular Biocompatible & | Lipophilic?*+ Stable??° No literature | Moderately 
ne imine) 53266 disperse??? | weight varies | Biodegradable**° Toxict*? 
a based on 
length of 
chain 
C13 | Poly (lactic- | CHEBI:53493 Mono- Molecular Biocompatible & | Lipophilict?®° | Stable? Inert?3” Non - 
co-glycolic disperse’??? | weight varies | Biodegradable*> 439 toxic° 
acid) of based on | 17 
length of 
chain 
C14 | Deoxycholic DrugBank ID: | Mono- 392.57 Biocompatible & | Lipophilic’*#74° | Stable Not Moderately 
acid DB03619 disperse’ Biodegradable” A148 inert?44.445 Toxic*® 
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Table 2 


Pharmacophoric features of carriers 


Carrier Carrier Number of Acceptor | Number of Donors Number of Number of 
No in molecule in molecule hydrophobic | Pharmacophore 
Region principle 
C1 Vitamin B12 (cobalamin) | 8 7 29 Nil 
C2 Vitamin H (Biotin) 2 2 5 6 
C3 Folic acid (Vitamin M / | 4 4 7 10 
Vitamin B9) 
C4 Vitamin B1 (Thiamin) 1 1 7 10 
C5 L-Carnitine (Vitamin BT) | 1 0 6 2 
C6 Poly-N-vinylpyrrolidone | 1 0 4 Nil 
oy Inulin 11 0 12 10 
C8 Poly Cysteine 1 1 1 7 
C9 Chitosan 1 1 1 10 
C10 Pectin 3 0 6 10 
G11 Poly(propylene glycol) 1 0 4 4 
C12 Poly(propylene imine) 0 1 3 Nil 
C13 Poly __(lactic-co-glycolic | 2 0 2 8 
acid) 
c14 Deoxycholic acid 2 0 14 7 
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Table 3 


Physico-chemical properties of drug delivering molecules 


ADME Descriptors 
i Blood Brain Plasma 
Se NG: | Carers Aqueous : CYP 2D6 a Intestinal 4 
a barrier fo ds Hepatotoxicity : Protein 
Solubility ‘ Binding Absorption dye 
Penetration Binding 
1 C1 -9.057 4.0 0.277 0.509 3.0 0.0 
2 C2 -1.432 -1.229 0.069 0.264 0.0 0.0 
3 C3 -3.378 4.0 0.277 0.662 3.0 0.0 
4 C4 -1.335 -1.253 0.336 0.423 0.0 0.0 
5 CS 2.734 4.0 0.059 0.066 3.0 0.0 
6 C6 -0.550 4.0 0.059 0.324 1.0 0.0 
7 C7 0.894 4.0 0.108 0.0139 3.0 0.0 
8 C8 0.335 -1.362 0.059 0.033 0.0 0.0 
9 c9 1.736 4.0 0.029 0.013 3.0 0.0 
10 C10 1.595 4.0 0.029 0.059 3.0 0.0 
11 C11 0.907 -1.039 0.0590 0.059 0.0 0.0 
12 C12 -0.097 4.0 0.059 0.324 1.0 0.0 
13 C13 0.768 -1.713 0.059 0.006 0.0 0.0 
14 C14 -4.409 -0.154 0.485 0.026 0.0 1.0 
C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - Folic acid (Vitamin M / Vitamin B9); C4 - 
Vitamin B1 (Thiamin); C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly 
Cysteine; C9 — Chitosan; C10 — Pectin; C11 - Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly 
(lactic-co-glycolic acid); C14 - Deoxycholic acid 
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Table 4 


Toxicity studies for Drug delivering molecules by Discovery Studio and Toxtree 


Toxicity 
Toxicity Prediction by Discovery Studio Prediction 
by Toxtree 
Rat Skin Skin Aerobic Toxicity 
Rat oral Maximum | Irritancy | sensitization | Biodegradability 
S. é ee : LD50 tolerated 
Carriers Rodent Mutagenecity 
Ne Carcinogenicity (e/Ke eaey oes 
weight) (g/Kg Body 
weight) 

1 C1 Non- Non- 0.093 0.000 Mild- Non- Degradable High 
carcinogen mutagen Irritant sensitizer 

2 C2 Non- Non- 1.109 0.193 Non- Non- Degradable Low 
carcinogen mutagen Irritant sensitizer 

3 C3 Non- Non- 2.819 1.391 Non- Weak- Non- High 
carcinogen mutagen Irritant sensitizer Degradable 

4 C4 carcinogen Non- 1.308 0.097 Non- Strong- Non- Low 

mutagen Irritant sensitizer Degradable 

5 cs Non- Non- 1.101 0.175 Mild- Non- Degradable Low 
carcinogen mutagen Irritant sensitizer 

6 C6 carcinogen Non- 1.634 0.181 Mild- Non- Degradable High 

mutagen Irritant sensitizer 

7 C7 Non- Non- 20.789 0.000 Mild- Non- Degradable Low 
carcinogen mutagen Irritant sensitizer 

8 C8 Non- Non- 0.514 0.653 Non- Non- Degradable Low 
carcinogen mutagen Irritant sensitizer 

9 co Non- Non- 3.241 0.268 Mild- Non- Degradable High 
carcinogen mutagen Irritant sensitizer 

10 | C10 Non- Non- 3.576 0.525 Non- Non- Degradable Low 
carcinogen mutagen Irritant sensitizer 

11 | C11 Non- Non- 12.098 0.187 Mild- Non- Degradable High 
carcinogen mutagen Irritant sensitizer 

12 | C12 Non- Non- 0.055 0.089 Mild- Non- Non- High 
carcinogen mutagen Irritant sensitizer Degradable 

13 | C13 Non- Non- 2.982 0.427 Non- Non- Degradable Low 
carcinogen mutagen Irritant sensitizer 

14 | C14 Non- Non- 6.358 0.190207 Severe- | Weak- Degradable High 
carcinogen mutagen Irritant sensitizer 

FDA — Food & Drug Administration; 

C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - Folic acid (Vitamin M / Vitamin B9); C4 - Vitamin B1 (Thiamin); C5 

- L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly Cysteine; C9 — Chitosan; C10 — Pectin; C11 - 

Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic acid); C14 - Deoxycholic acid 
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Table 5 


Conjugation results of Human Insulin Monomer (PDB ID: 1HLS), 


with all listed drug delivering molecules individually 


S.No Conjugate Binding site LIP DOCK Binding site LIP DOCK Other LIP DOCK 
(LYS B29) SCORE - (PHE B11) SCORE - Binding site SCORE 
C1 Vitamin B12 - - - - 
C2 CYSA11 
Vitamin H - - HIS B10 86.2835 
GLN B4 
C3 Vitamin M - - - - 
C4 CYS A6 
Vitamin B1 - - CYSA11 94.1144 
LEU B6 
cS ; : LEU A13 
Vitamin BT - - VAL B2 68.0592 
C6 poly-N- 
vinylpyrrolidone 7 : 7 7 
C7 Inulin PHE B1 117.663 - - 
C8 Poly Cysteine - - GLN B4 53.1597 
co CYS A11, 
Chitosan - - VAL B2, 74.4251 
ASN B3 
c10 PECTIN - - CYSA11 63.0902 
C11 POLY (PROPYLENE 
GLYCOL) - - VAL B2 76.0238 
C12 | poly(propylene 
imine) : : ” 7 
Gee 1) hely Uaetlcse: PHE B1 68.6737 CYS A11 68.6737 
glycolic acid) 
C14 | Deoxycholic acid - - - - 
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Table 6 


Conjugation results of Human insulin hexamer (PDB ID: 1AlO), 


with all listed drug delivering molecules individually 


S.No Conjugate Binding site LIP DOCK Binding site LIP DOCK Other LIP DOCK 
; Jus (LYS B29) SCORE - (PHE B1) SCORE - Binding site SCORE 
CL Vit B12 - - - - - 
‘ GLN B4 
C2 Vit H - - - ARG B22 103.231 
' GLY A1 
C3 Vit M - - - THR B27 114.324 
c4 Vit B1 LYS B29 79.8834 - - - 
cS Vit BT - - - ARG B22 84.7767 
Cy, Pon Se 3. - - : : : 
vinylpyrrolidone 
C7 Inulin LYS B29 94.3543 - - - 
: TYR B16 
C8 Poly Cysteine - - - GLU B21 58.5629 
TYR B16 
co Chitosan - - - TYR B26 90.016 
c10 PECTIN - - - aes 91.0549 
POLY (PROPYLENE GLU B21 
ca GLYCOL) ” . : ARG B22 phot 
C12 poly(propylene ; ; ; : ; 
imine) 
Poly (lactic-co- GLY B20 
te glycolic acid) 7 7 : ARG B22 Benes 
‘ : GLY A1 
C14 | Deoxycholic acid - - - ILE A2 60.763 
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Table 7 


Conjugation results of Insulin Lispro (PDB ID: 1 LPH), with all listed drug delivering molecules individually 


S.No Coniuigata Binding site LIP DOCK Binding site LIP DOCK Other LIP DOCK 
(LYS B28) SCORE - (PHE B1) SCORE - Binding site SCORE 
C1 Vit B12 - - - - 
C2 Vit H - - - - 
GLY A1 
ILE A2, 
VAL A3, 
GLU A4, 
C3 Vit M LYS B28 131.57 TYR A19, 131.57 
GLN B4 
HIS BS 
THR B27, 
THR B30 
c4 Vit B1 LYS B28 89.8971 GLU A4 89.8971 
; GLU A4, 
cS Vit BT - - GLN B4 58.6757 
ce. |Pev : : : : 
vinylpyrrolidone 
GLY A1 
C7 Inulin LYS B28 76.2195 VAL A3, 76.2195 
GLU A4 
; GLN B4 
C8 Poly Cysteine - - GLU AA 48.858 
GLY A1 
: ILE A2 
c9 Chitosan - - GLU A4 67.1639 
THR B27 
GLY A1 
C10 | PECTIN - - GLU A4 73.2077 
POLY (PROPYLENE 
C11 GLYCOL) - - GLY A1 57.6819 
C12 poly(propylene : ; ; f 
imine) 
GLY A1, 
ILE A2, 
Poly (lactic-co- sean fit 
C13 sivcdlie aca) - - VAL A3, 64.8442 
THR B27, 
GLN 44 
C14 | Deoxycholic acid - - - - 


Table 8 
Interaction results of Oral insulin conjugates (OIC) with Insulin Receptor (IR) 
Interaction with leucine-rich | Interaction with C-terminus 
S. No Conjugates repeat domain (L1, residues of the a-chain (aCT, 
1-157) residues 704-715) 
1 Paulin Menomen (HES 2PM No Interaction No Interaction 
Conjugates 
> Insulin Hexamer (1AI0)- DDM ARG86 NO iteracica 
Conjugates ASN34 
ae ARG86 
3 Sees GEE) RRM ASN9O No Interaction 
ARG114 
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Supplementary figure 1 

Pharmacophoric principles of Drug delivering molecules are illustrated by Discovery Studio software. Acceptors (Green in color), 
Donors (Magenta in color) of Pharmacophoric principles of all Drug delivering molecules are demonstrated and differentiated by 
color. C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - Folic acid (Vitamin M / Vitamin B9); C4 - Vitamin B1 (Thiamin); 
C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly Cysteine; C9 — Chitosan; C10 — Pectin; C11 - 
Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic acid); C14 - Deoxycholic acid 


Supplementary Figure (1a): C1 - Vitamin B12 (cobalamin) 
Number of Pharmacophoric principles — Nil 


Supplementary Figure (1b): C2 - Vitamin H (Biotin) 
Number of Pharmacophoric principles —6 
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Supplementary Figure (1c): C3 - Folic acid (Vitamin M / Vitamin B9) 
Number of Pharmacophoric principles —10 
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Supplementary Figure (1d): C4 - Vitamin B1 (Thiamin) 
Number of Pharmacophoric principles —10 
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Supplementary Figure (1e): C5 - L-Carnitine (Vitamin BT) 
Number of Pharmacophoric principles —2 


Supplementary Figure (1f): C6- Poly-N-vinylpyrrolidone 
Number of Pharmacophoric principles —Nil 
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Supplementary Figure (1g): C7- Inulin 
Number of Pharmacophoric principles —10 
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Supplementary Figure (1h): C8- Poly Cysteine 
Number of Pharmacophoric principles —7 
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Supplementary Figure (1i): C9 — Chitosan 
Number of Pharmacophoric principles —10 
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Supplementary Figure (1j): C10 — Pectin 
Number of Pharmacophoric principles —10 
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Supplementary Figure (1k): C11 — Poly(propylene glycol) 
Number of Pharmacophoric principles —4 


Supplementary Figure (1L): C12 — Poly(propylene imine) 
Number of Pharmacophoric principles —Nil 
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Supplementary Figure (1m): C13 — Poly (lactic-co-glycolic acid) 
Number of Pharmacophoric principles —8 
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Supplementary Figure (1n): C14 — Deoxycholic acid 
Number of Pharmacophoric principles —7 
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Supplementary figure 2 

ADME descriptors of Drug delivering molecules by Discovery Studio; C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - 
Folic acid (Vitamin M / Vitamin B9); C4 - Vitamin B1 (Thiamin); C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; 
Inulin; C8 - Poly Cysteine; C9 — Chitosan; C10 — Pectin; C11 - Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly 
(lactic-co-glycolic acid); C14 - Deoxycholic acid 


Supplementary figure 2(a)- C1 - Vitamin B12 
ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(b)- C2 - Vitamin H (Biotin) 
ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(c)- C3 - Folic acid (Vitamin M / Vitamin B9) 


ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(d)- C4 - Vitamin B1 (Thiamin) 


ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(e)- C5 - L-Carnitine (Vitamin BT) 
ADMET_AlogP98 vs. ADMET_PSA_2D 
8 


ADMET_AlogP98: 
Nu 


4 
-50 -25 i) 25 50 75 100 125 150 
ADMET_PSA_2D 


Supplementary figure 2(f)- C6 - Poly-N-vinylpyrrolidone 


ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(g)- C7- Inulin 


ADMET_AlogP98 vs. ADMET_PSA_2D 
8 


ADMET_AlogP98 


=z 


ADMET_PSA_2D 


Supplementary figure 2(h)- C8 - Poly Cysteine 
ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(i)- C9 — Chitosan 


ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(j)- C10 — Pectin 
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Supplementary figure 2(k)- C11 - Poly(propylene glycol) 
ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(I)- C12 - Poly(propylene imine) 


ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(m)- C13 - Poly (lactic-co-glycolic acid) 


ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 2(n)- C14 - Deoxycholic acid 
ADMET_AlogP98 vs. ADMET_PSA_2D 
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Supplementary figure 3 

Toxicity studies for Drug delivering molecules by Discovery Studio; C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - Folic acid 
(Vitamin M / Vitamin B9); C4 - Vitamin B1 (Thiamin); C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - 
Poly Cysteine; C9 — Chitosan; C10 — Pectin; C11 - Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic 


acid); C14 - Deoxycholic acid 
Supplementary figure 3(a)- C1 - Vitamin B12 (cobalamin) 


FDA Rodent Carcinogenicity 


Molecular Weight: 1355.36518 
ALogP: -0.442 

Rotatable Bonds: 16 
Acceptors: 16 

Donors: 9 


Model Prediction 


Prediction: Non-Carcinogen 
Probability: 0.230 
Enrichment: 0.717 


Bayesian Score: -1.075 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Mutagenecity 


ALogP: -0.442 
Rotatable Bonds: 16 
Acceptors: 16 


Enrichment: 0.053 


Bayesian Score: -21.617 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value 

Probability: The esimated probability that the sample is in the 
category. 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) Rat Maximum tolerated dose (g/Kg Body weight) 


C,,H,,CON,,0,,P 

Molecular Weight: 1355.36518 
ALogP: -0.442 ALOogP: -0.442 
Rotatable Bonds: 16 Rotatable Bonds: 16 
Acceptors: 16 Acceptors: 16 
Donors: 9 Donors: 9 


Model Prediction Model Prediction 


Prediction: 0.093 Prediction: 0.000 
Unit: g/kg_body_weight Unit: g/kg_body_weight 
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Skin Irritancy 


Rotatable Bonds: 16 
Acceptors: 16 
Donors: 9 


Bayesian Score: -10.755 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category. 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Skin sensitization 


Acceptors: 16 
Donors: 9 


Model Prediction 


Prediction: Non-Sensitizer 
Probability: 0.502 
Enrichment: 0.731 
Bayesian Score: -4.826 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category. 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Aerobic Biodegradability 


Molecular Weight: 1355.36518 
ALogP: -0.442 

Rotatable Bonds: 16 
Acceptors: 16 

Donors: 9 


Model Prediction 


Prediction: Degradable 
Probability: 0.634 
Enrichment: 1.454 


Bayesian Score: 3.091 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category. 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(b)- C2 - Vitamin H (Biotin) 


FDA Rodent Carcinogenicity 


CoH, .N,0.8 

Molecular Weight: 244.31064 
ALogP: 0.67 

Rotatable Bonds: 5 


Model Prediction 


Prediction: Non-Carcinogen 
Probability: 0.435 
Enrichment: 0.845 


Bayesian Score: -2.888 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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C,oH,~N,O,S 

Molecular Weight: 244.31064 
ALogP: 0.67 

Rotatable Bonds: 5 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.002 
Enrichment: 0.004 
Bayesian Score: -29.037 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) 


C.oH,N,O,S 

Molecular Weight: 244.31064 
ALogP: 0.67 

Rotatable Bonds: 5 


Model Prediction 
Prediction: 1.109 
Unit: g/kg_body_weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


CoH, ~N,0.S 

Molecular Weight: 244.31064 
ALogP: 0.67 

Rotatable Bonds: 5 


Model Prediction 
Prediction: 0.193 
Unit: g/kg_body_ weight 
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Skin Irritancy 


C,gH,,N,0,S 


Skin sensitization 


C,9H,,N,0,S 


Molecular Weight: 244.31064 Molecular Weight: 244.31064 


ALogP: 0.67 


ALogP: 0.67 


Rotatable Bonds: 5 Rotatable Bonds: 5 


Model Prediction Model Prediction 


Prediction: Non-Irritant Prediction: Non-Sensitizer 
Probability: 0.971 

Enrichment: 1.054 Enrichment: 0.412 
Bayesian Score: -0.769 Bayesian Score: -7.644 


Prediction: Positive if the Bayesian score is above the estimated | Prediction: Positive if the Bayesian score is above the estimated 


best cutoff value. 


best cutoff value. 


Probability: The esimated probability that the sample is in the Probability: The esimated probability that the sample is in the 


category 


category. 


Enrichment: An estimate of enrichment, that is, the increased Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian Bayesian Score: The standard Laplacian-modified Bayesian 


score. 


score. 
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Aerobic Biodegradability 


C,oH,N,O,S 

Molecular Weight: 244.31064 
ALogP: 0.67 

Rotatable Bonds: 5 


Prediction: Degradable 
Probability: 0.759 
Enrichment: 1.739 


Bayesian Score: 6.104 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(c)- C3 - Folic acid (Vitamin M / Vitamin B9) 


FDA Rodent Carcinogenicity 


C.cH,.N>O, 

Molecular Weight: 441.39745 
ALogP: -0.232 

Rotatable Bonds: 9 


Model Prediction 


Prediction: Non-Carcinogen 
Probability: 0.210 
Enrichment: 0.655 


Bayesian Score: -4.767 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Mutagenecity 


C.cH,.NO, 

Molecular Weight: 441.39745 
ALogP: -0.232 

Rotatable Bonds: 9 


Model Prediction 
Prediction: Non-Mutagen 


Bayesian Score: -60.476 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) 


C.cH,N5O; 

Molecular Weight: 441.39745 
ALogP: -0.232 

Rotatable Bonds: 9 


Model Prediction 
Prediction: 2.819 
Unit: g/kg_body_weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


C.cH,N5O; 

Molecular Weight: 441.39745 
ALogP: -0.232 

Rotatable Bonds: 9 


Model Prediction 
Prediction: 1.391 
Unit: g/kg_body_ weight 
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Skin Irritancy Skin sensitization 


C,H,.N>O; C.cH,.NO, 
Molecular Weight: 441.39745 Molecular Weight: 441.39745 
ALogP: -0.232 ALogP: -0.232 


Rotatable Bonds: 9 


Prediction: Non-Irritant 

Probability: 0.942 

Enrichment: 1.023 Enrichment: 1.035 
Bayesian Score: -1.988 Bayesian Score: -2.527 


Prediction: Positive if the Bayesian score is above the estimated Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. best cutoff value. 

Probability: The esimated probability that the sample is in the Probability: The esimated probability that the sample is in the 
category category 

Ennchment: An estimate of enrichment, that is, the increased Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian Bayesian Score: The standard Laplacian-modified Bayesian 
score. score. 
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Aerobic Biodegradability 


Chere’ 
OHO 


C.H,.N>O; 
Molecular Weight: 441.39745 
ALogP: -0.232 


Prediction: Non-Degradable 
Probability: 0.267 
Enrichment: 0.612 


Bayesian Score: -5.103 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(d)- C4 - Vitamin B1 (Thiamin) 


FDA Rodent Carcinogenicity 


C,,H,,N,OS 

Molecular Weight: 265.35458 
ALogP: 1.048 

Rotatable Bonds: 4 


Bayesian Score: -0.319 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Mutagenecity 


C,,H,,N,OS 

Molecular Weight: 265.35458 
ALogP: 1.048 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.274 
Enrichment: 0.490 
Bayesian Score: -12.406 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) Rat Maximum tolerated dose (g/Kg Body weight) 


C,,H,,N,OS C,,H,,N,OS 

Molecular Weight: 265.35458 Molecular Weight: 265.35458 
ALogP: 1.048 

Rotatable Bonds: 4 


Model Prediction Model Prediction 
Prediction: 1.308 Prediction: 0.097 
Unit: g/kg_body_ weight Unit: g/kg_body_ weight 
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Skin Irritancy 


C,.H,,N,OS 
Molecular Weight: 265.35458 
ALogP: 1.048 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Non-Iritant 


Bayesian Score: -1.251 


Skin sensitization 


C,,H,,N,OS 

Molecular Weight: 265.35458 
ALogP: 1.048 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Strong-Sensitizer 


Bayesian Score: 1.059 


Prediction: Positive if the Bayesian score is above the estimated | |Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. best cutoff value. 

Probability: The esimated probability that the sample is in the Probability: The esimated probability that the sample is in the 
category. category 

Enrichment: An estimate of enrichment, that is, the increased Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian Bayesian Score: The standard Laplacian-modified Bayesian 
score. score. 
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Aerobic Biodegradability 


C.H,;N,OS 

Molecular Weight: 265.35458 
ALogP: 1.048 

Rotatable Bonds: 4 


Model Prediction 


Prediction: Non-Degradable 
Probability: 0.208 
Enrichment: 0.478 
Bayesian Score: -6.865 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(e)- C5 - L-Carnitine (Vitamin BT) 


FDA Rodent Carcinogenicity 


C;H,,NO, 

Molecular Weight: 161.19889 
ALogP: -3.29 

Rotatable Bonds: 4 


Model Prediction 


Prediction: Non-Carcinogen 
Probability: 0.218 
Enrichment: 0.681 


Bayesian Score: -2.327 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Mutagenecity 


C;H,,NO, 

Molecular Weight: 161.19889 
ALogP: -3.29 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.543 
Enrichment: 0.972 
Bayesian Score: -6.539 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) 


C,H,.NO, 

Molecular Weight: 161.19889 
ALogP: -3.29 

Rotatable Bonds: 4 


Model Prediction 
Prediction: 1.101 
Unit: g/kg_body_weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


CjH,.NO, 
Molecular Weight: 161.19889 
ALogP: -3.29 


Model Prediction 
Prediction: 0.175 
Unit: g/kg_body weight 


Pave |. 85 


Skin Irritancy 


C;H,.NO, 
Molecular Weight: 161.19889 
ALogP: -3.29 


Skin sensitization 


C;H,,NO, 
Molecular Weight: 161.19889 
ALogP: -3.29 


Rotatable Bonds: 4 


Model Prediction 

Prediction: Non-Sensitizer 

Probability: 0.636 
Enrichment: 0.462 Enrichment: 0.926 
Bayesian Score: -6.587 Bayesian Score: -2.950 


Prediction: Positive if the Bayesian score is above the estimated Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. best cutoff value. 

Probability: The esimated probability that the sample is in the Probability: The esimated probability that the sample is in the 
category category. 

Enrichment: An estimate of enrichment, that is, the increased Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian Bayesian Score: The standard Laplacian-modified Bayesian 
score. score. 
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Aerobic Biodegradability 


C5H,.NO, 

Molecular Weight: 161.19889 
ALogP: -3.29 

Rotatable Bonds: 4 


Model Prediction 


Prediction: Degradable 
Probability: 0.621 
Enrichment: 1.422 
Bayesian Score: 2.788 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(f)- C6 - Poly-N-vinylpyrrolidone 


FDA Rodent Carcinogenicity 


C,H,*,NO 

Molecular Weight: 111.14176 
ALogP: 6.1e-002 

Rotatable Bonds: 2 


Model Prediction 


Prediction: Carcinogen 
Probability: 0.256 
Enrichment: 0.798 


Bayesian Score: 0.693 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Mutagenecity 


C,H,*,NO 

Molecular Weight: 111.14176 
ALogP: 6.1e-002 

Rotatable Bonds: 2 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.494 
Enrichment: 0.885 
Bayesian Score: -7.677 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample ts in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) 


Molecular Weight: 111.14176 
ALogP: 6.1e-002 
Rotatable Bonds: 2 


Model Prediction 
Prediction: 1.634 
Unit: g/kg_body_ weight 


Rat Maximum tolerated dose (g/Kg Body weight) 


Molecular Weight: 111.14176 
ALogP: 6.1¢e-002 
Rotatable Bonds: 2 


Model Prediction 
Prediction: 0.181 
Unit: g/kg_body_weight 
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Skin Irritancy Skin sensitization 


C,H,*,NO C,H,*,NO 

Molecular Weight: 111.14176 Molecular Weight: 111.14176 
ALogP: 6.1e-002 
Rotatable Bonds: 2 


Model Prediction 
Prediction: Non-Sensitizer 


Bayesian Score: -2.583 Bayesian Score: -6.913 


Prediction: Positive if the Bayesian score is above the estimated Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. best cutoff value. 

Probability: The esimated probability that the sample is in the Probability: The esimated probability that the sample is in the 
category category 

Ennchment: An estimate of enrichment, that is, the increased Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian Bayesian Score: The standard Laplacian-modified Bayesian 
score. score. 
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Aerobic Biodegradability 


C,H,*,NO 

Molecular Weight: 111.14176 
ALogP: 6.1e-002 

Rotatable Bonds: 2 


Model Prediction 
Prediction: Degradable 


Bayesian Score: 4.656 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(g)- C7; Inulin 


FDA Rodent Carcinogenicity 


OH 


C.2H0 1, 
Molecular Weight: 342.29648 


Prediction: Non-Carcinogen 
Probability: 0.216 
Enrichment: 0.673 
Bayesian Score: -2.679 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Mutagenecity 


OH 


C.2H9 1, 
Molecular Weight: 342.29648 


Prediction: Non-Mutagen 
Probability: 0.322 
Enrichment: 0.577 


Bayesian Score: -11.346 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category. 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) 


OH 


= O 
OH ~—OH 


C.2H,.9), 

Molecular Weight: 342.29648 
ALogP: -4.361 

Rotatable Bonds: 6 


Model Prediction 
Prediction: 20.789 
Unit: g/kg_body_weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


C.2H294, 

Molecular Weight: 342.29648 
ALogP: -4.361 

Rotatable Bonds: 6 


Model Prediction 
Prediction: 0.000 
Unit: g/kg_body_weight 
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Skin Irritancy 


OH ~-~OH 


C.3H220,, 

Molecular Weight: 342.29648 
ALogP: -4.361 

Rotatable Bonds: 6 


Bayesian Score: -6.949 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Skin sensitization 


OH ~-~OH 


C.3H20,, 

Molecular Weight: 342.29648 
ALogP: -4.361 

Rotatable Bonds: 6 


Model Prediction 
Prediction: Non-Sensitizer 
Probability: 0.670 
Enrichment: 0.977 
Bayesian Score: -2.394 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Aerobic Biodegradability 


OH 


C.2H220,, 

Molecular Weight: 342.29648 
ALogP: -4.361 

Rotatable Bonds: 6 


Prediction: Degradable 
Probability: 0.609 
Enrichment: 1.395 


Bayesian Score: 2.532 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(h)- C8 - Poly Cysteine 


FDA Rodent Carcinogenicity 


C,H,NO,S 

Molecular Weight: 121.15818 
ALogP: -3.078 

Rotatable Bonds: 2 


Model Prediction 


Prediction: Non-Carcinogen 
Probability: 0.210 
Enrichment: 0.656 


Bayesian Score: -4.229 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Mutagenecity 


C,H,NO,S 

Molecular Weight: 121.15818 
ALogP: -3.078 

Rotatable Bonds: 2 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.631 
Enrichment: 1.130 
Bayesian Score: -4.187 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) 


C,H,NO,S 

Molecular Weight: 121.15818 
ALogP: -3.078 

Rotatable Bonds: 2 


Model Prediction 
Prediction: 0.514 
Unit: g/kg_body_weight 


Rat Maximum tolerated dose (g/Kg Body weight) 


C,H,NO,S 

Molecular Weight: 121.15818 
ALogP: -3.078 

Rotatable Bonds: 2 


Model Prediction 
Prediction: 0.653 
Unit: g/kg_body_weight 
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Skin Irritancy 


C,H,NO,S 
Molecular Weight: 121.15818 
ALogP: -3.078 


Prediction: Non-Irritant 
Probability: 0.971 
Enrichment: 1.055 
Bayesian Score: -0.704 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Skin sensitization 


C,H,NO,S 
Molecular Weight: 121.15818 
ALogP: -3.078 


Prediction: Non-Sensitizer 
Probability: 0.725 
Enrichment: 1.057 
Bayesian Score: -1.390 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Aerobic Biodegradability 


C,H,NO,S 
Molecular Weight: 121.15818 
ALogP: -3.078 


Prediction: Degradable 
Probability: 0.562 
Enrichment: 1.289 


Bayesian Score: 1.548 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(i)- C9 — Chitosan 


FDA Rodent Carcinogenicity 


. OH Chiral 


oa. 4 


NH? 
C,H,,NO, 
Molecular Weight: 179.17112 


ALogP: -2.804 
Rotatable Bonds: 1 


Model Prediction 
Prediction: Non-Carcinogen 
Probability: 0.215 
Enrichment: 0.672 
Bayesian Score: -2.754 


Mutagenecity 


. OH Chiral 


« a 


“ou 


NH? 
C-H,.NO, 
Molecular Weight: 179.17112 


ALogP: -2.804 
Rotatable Bonds: 1 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.711 
Enrichment: 1.273 
Bayesian Score: -1.342 


Prediction: Positive if the Bayesian score is above the estimated Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. best cutoff value. 

Probability: The esimated probability that the sample is in the Probability: The esimated probability that the sample is in the 
category category 

Enrichment: An estimate of enrichment, that is, the increased Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian Bayesian Score: The standard Laplacian-modified Bayesian 
score. score. 
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Rat oral LD50 (g/Kg Body weight) 


NH» 


C.H,,NO, 
Molecular Weight: 179.17112 
ALogP: -2.804 


Unit: g/kg_body_ weight 


Rat Maximum tolerated dose (g/Kg Body weight) 


NH? 
C,H,,NO, 
Molecular Weight: 179.17112 
ALogP: -2.804 


Model Prediction 
Prediction: 0.268 
Unit: g/kg_body_weight 
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Skin Irritancy Skin sensitization 


NH2 NH2 
C,H,,NO, C,H,,NO, 


Molecular Weight: 179.17112 
ALogP: -2.804 
Rotatable Bonds: 1 


Molecular Weight: 179.17112 
ALogP: -2.804 
Rotatable Bonds: 1 


Model Prediction 

Prediction: Non-Sensitizer 

Probability: 0.631 
Enrichment: 0.659 Enrichment: 0.920 
Bayesian Score: -4.415 Bayesian Score: -3.018 


Prediction: Positive if the Bayesian score is above the estimated | |Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. best cutoff value. 

Probability: The esimated probability that the sample is in the Probability: The esimated probability that the sample is in the 
category category 

Enrichment: An estimate of enrichment, that is, the increased Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. | |likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian Bayesian Score: The standard Laplacian-modified Bayesian 
score. score. 
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Paez 0) 2 


Aerobic Biodegradability 


“oO 


“ou 


NH» 


C.H,,NO, 

Molecular Weight: 179.17112 
ALogP: -2.804 

Rotatable Bonds: 1 
Acceptors: 6 

Donors: 5 


Model Prediction 
Prediction: Degradable 


Bayesian Score: 2.775 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Supplementary figure 3(j)- C10 — Pectin 


FDA Rodent Carcinogenicity 


OH 
C2H,.0, 
Molecular Weight: 194.1394 


ALogP: -2.386 
Rotatable Bonds: 1 


Model Prediction 


Prediction: Non-Carcinogen 
Probability: 0.213 
Enrichment: 0.666 


Bayesian Score: -3.131 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Mutagenecity 


C,H,.0, 

Molecular Weight: 194.1394 
ALogP: -2.386 

Rotatable Bonds: 1 


Model Prediction 
Prediction: Non-Mutagen 


Bayesian Score: -5.935 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Rat oral LD50 (g/Kg Body weight) 


C,H,.0, 

Molecular Weight: 194.1394 
ALogP: -2.386 

Rotatable Bonds: 1 


Model Prediction 
Prediction: 0.525 
Unit: g/kg_body_ weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


C.H,0, 

Molecular Weight: 194.1394 
ALogP: -2.386 

Rotatable Bonds: 1 


Model Prediction 
Prediction: 3.576 
Unit: g/kg_body_weight 


Pavez, 0) 5 


Skin Irritancy 


OH 


C.H,.0; 
Molecular Weight: 194.1394 
ALogP: -2.386 


Prediction: Non-Irritant 
Probability: 0.971 
Enrichment: 1.055 
Bayesian Score: -0.726 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Skin sensitization 


C2H,.0, 
Molecular Weight: 194.1394 
ALogP: -2.386 


Prediction: Non-Sensitizer 
Probability: 0.704 
Enrichment: 1.026 
Bayesian Score: -1.792 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


PaseZ 0) 6 


Aerobic Biodegradability 


C.H,,0, 

Molecular Weight: 194.1394 
ALogP: -2.386 

Rotatable Bonds: 1 


Model Prediction 


Prediction: Degradable 
Probability: 0.632 
Enrichment: 1.447 
Bayesian Score: 3.026 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category. 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Paez (0) 7 


Supplementary figure 3(k)- C11 - Poly(propylene glycol) 


FDA Rodent Carcinogenicity 


C.H,402 
Molecular Weight: 134.17356 
ALogP: -0.274 


Rotatable Bonds: 4 


Prediction: Non-Carcinogen 
Probability: 0.222 
Enrichment: 0.692 


Bayesian Score: -1.865 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


© The Author(s) 2015. Open Access. This article is licensed under a Creative Commons Attribution License 4.0 (CC BY 4.0) 


Mutagenecity 


C2H,40, 
Molecular Weight: 134.17356 
ALogP: -0.274 


Prediction: Non-Mutagen 
Probability: 0.686 
Enrichment: 1.229 


Bayesian Score: -2.332 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Paez 08 


Rat oral LD50 (g/Kg Body weight) 


C,H,,0, 

Molecular Weight: 134.17356 
ALogP: -0.274 

Rotatable Bonds: 4 


Model Prediction 
Prediction: 12.098 
Unit: g/kg_body_weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


CeH,40. 

Molecular Weight: 134.17356 
ALogP: -0.274 

Rotatable Bonds: 4 


Model Prediction 
Prediction: 0.187 
Unit: g/kg_body_weight 


pave Z 09 


Skin Irritancy 


CeH,492 
Molecular Weight: 134.17356 
ALogP: -0.274 


Enrichment: 0.664 
Bayesian Score: -4.358 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Skin sensitization 


CeH,402 
Molecular Weight: 134.17356 
ALogP: -0.274 


Prediction: Non-Sensitizer 
Probability: 0.513 
Enrichment: 0.747 


Bayesian Score: -4.681 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Paez, 10 


Aerobic Biodegradability 


C.H,,0, 

Molecular Weight: 134.17356 
ALogP: -0.274 

Rotatable Bonds: 4 


Model Prediction 


Prediction: Degradable 
Probability: 0.711 
Enrichment: 1.629 
Bayesian Score: 4.861 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Paez, 1 1 


Supplementary figure 3(I)- C12 - Poly(propylene imine) 


FDA Rodent Carcinogenicity 


C,H,*,N 

Molecular Weight: 57.09438 
ALogP: 0.131 

Rotatable Bonds: 3 


Model Prediction 


Prediction: Non-Carcinogen 
Probability: 0.558 
Enrichment: 1.417 


Bayesian Score: -0.647 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Mutagenecity 


C,H,*,N 

Molecular Weight: 57.09438 
ALogP: 0.131 

Rotatable Bonds: 3 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.666 
Enrichment: 1.192 
Bayesian Score: -3.064 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Paez 12 


Rat oral LD50 (g/Kg Body weight) 


C,H,*,N 

Molecular Weight: 57.09438 
ALogP: 0.131 

Rotatable Bonds: 3 


Model Prediction 
Prediction: 0.055 
Unit: g/kg_body_weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


C,H,",N 

Molecular Weight: 57.09438 
ALogP: 0.131 

Rotatable Bonds: 3 


Model Prediction 
Prediction: 0.089 
Unit: g/kg_body_weight 


page, 1 3 


Skin Irritancy Skin sensitization 


C,H,*,N C,H,*,N 

Molecular Weight: 57.09438 Molecular Weight: 57.09438 
ALogP: 0.131 
Rotatable Bonds: 3 


Model Prediction 
Prediction: Non-Sensitizer 


Enrichment: 0.748 Enrichment: 0.993 
Bayesian Score: -3.519 Bayesian Score: -2.202 


Prediction: Positive if the Bayesian score is above the estimated Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. best cutoff value. 

Probability: The esimated probability that the sample is in the Probability: The esimated probability that the sample is in the 
category category 

Enrichment: An estimate of enrichment, that is, the increased Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category likelihood (versus random) of this sample being im the category. 
Bayesian Score: The standard Laplacian-modified Bayesian Bayesian Score: The standard Laplacian-modified Bayesian 
score. score. 
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Parvez. 14 


Aerobic Biodegradability 


C,H,*,N 

Molecular Weight: 57.09438 
ALogP: 0.131 

Rotatable Bonds: 3 


Model Prediction 
Prediction: Non-Degradable 


Bayesian Score: 0.529 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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pave Z, 1 5 


Supplementary figure 3(m)- C13 - Poly (lactic-co-glycolic acid) 


FDA Rodent Carcinogenicity 


C.H.O; 

Molecular Weight: 148.11402 
ALogP: -0.686 

Rotatable Bonds: 4 


Model Prediction 


Prediction: Non-Carcinogen 
Probability: 0.213 
Enrichment: 0.664 


Bayesian Score: -3.290 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Mutagenecity 


C.H.O; 

Molecular Weight: 148.11402 
ALogP: -0.686 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.543 
Enrichment: 0.972 


Bayesian Score: -6.541 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Padez 16 


Rat oral LD50 (g/Kg Body weight) 


C.H,0. 

Molecular Weight: 148.11402 
ALogP: -0.686 

Rotatable Bonds: 4 


Model Prediction 
Prediction: 2.982 
Unit: g/kg_body_weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


C,H.O, 

Molecular Weight: 148.11402 
ALogP: -0.686 

Rotatable Bonds: 4 


Model Prediction 
Prediction: 0.427 
Unit: g/kg_body weight 


Paez 1 if 


Skin Irritancy 


C.H.O. 
Molecular Weight: 148.11402 
ALogP: -0.686 


Bayesian Score: -0.615 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Skin sensitization 


C.H,O; 
Molecular Weight: 148.11402 
ALogP: -0.686 


Bayesian Score: -1.565 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category. 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Paez, 18 


Aerobic Biodegradability 


C.H.0, 

Molecular Weight: 148.11402 
ALogP: -0.686 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Degradable 


Bayesian Score: 5.017 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Pave Z, 1 9 


Supplementary figure 3(n)- C14 - Deoxycholic acid 


FDA Rodent Carcinogenicity 


CoH gq, 

Molecular Weight: 392.57199 
ALogP: 4.082 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Non-Carcinogen 


Bayesian Score: -1.366 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Mutagenecity 


OH 


CoH yg, 

Molecular Weight: 392.57199 
ALogP: 4.082 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Non-Mutagen 
Probability: 0.000 
Enrichment: 0.000 
Bayesian Score: -64.641 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Paez, 2 0 


Rat oral LD50 (g/Kg Body weight) 


CogH ype 

Molecular Weight: 392.57199 
ALogP: 4.082 

Rotatable Bonds: 4 


Model Prediction 
Prediction: 6.358 
Unit: g/kg_body_ weight 
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Rat Maximum tolerated dose (g/Kg Body weight) 


CoH yg 

Molecular Weight: 392.57199 
ALogP: 4.082 

Rotatable Bonds: 4 


Model Prediction 
Prediction: 0.190 
Unit: g/kg_body_ weight 


Pace 2 1 


Skin Irritancy 


OH > 


CoH gq, 

Molecular Weight: 392.57199 
ALogP: 4.082 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Moderate_Severe 
Probability: 0.472 
Enrichment: 1.283 

Bayesian Score: 1.758 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category. 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Skin sensitization 


C4440, 

Molecular Weight: 392.57199 
ALogP: 4.082 

Rotatable Bonds: 4 


Bayesian Score: -8.518 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Enrichment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 


Pace Z Z 2 


Aerobic Biodegradability 


Cog ype 

Molecular Weight: 392.57199 
ALogP: 4.082 

Rotatable Bonds: 4 


Model Prediction 
Prediction: Degradable 


Bayesian Score: 8.551 


Prediction: Positive if the Bayesian score is above the estimated 
best cutoff value. 

Probability: The esimated probability that the sample is in the 
category 

Ennchment: An estimate of enrichment, that is, the increased 
likelihood (versus random) of this sample being in the category 
Bayesian Score: The standard Laplacian-modified Bayesian 
score. 
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Pave Z Z 3 


Supplementary figure 4 


Toxicity studies for Drug delivering molecules by Toxtree; C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - Folic acid (Vitamin 
M / Vitamin B9); C4 - Vitamin B1 (Thiamin); C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly 
Cysteine; C9 — Chitosan; C10 — Pectin; C11 - Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic 


acid); C14 - Deoxycholic acid 


Supplementary figure 4(a)- C1 - Vitamin B12 (cobalamin) 


% Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 


el) 


File Edit Chemical Compounds Toxic Hazard Method Help 


File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\1. Vitamin B12.mol 


Available structure attributes 

|High (Class III) 

|G: \VASANTH - Phd\prug di... 
itoxTree. tree.cramer.Cram... [IN,2N,3Y,4N 


Structure diagram 


Supplementary figure 4(b)- C2 - Vitamin H (Biotin) 


%\ Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 


by Cramer rules 


Low (Class I) 


Intermediate (Class II) 


High (Class I) 


V| Verbose explanation 


Cramer rules 

&@ Q1 Normal constituent of the body No 

aa Q2.Contains functional groups associated with enhanced 
toxicity No 

&a Q3.Contains elements other than C.H.O.N.divalent S Yes 

8 Q4 Elements not listed in Q3 occurs only as a Na,K.Ca.Mg.N 
salt, sulphamate. sulphonate, sulphate, hydrochloride 

.-- No Class High (Class IIT) 


a) 


File Edit Chemical Compounds Toxic Hazard Method Help 


File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\2. Vitamin H.mol 


Available structure attributes 

Low (Class I) 

|G:\VASANTH -Phd\prug di... 
toxTree. tree.cramer.Cram...|1Y 


Structure diagram 


Toxic Hazard 


Low (Class I) 


Intermediate (Class 11) 


High (Class I) 


¥| Verbose explanation 


Cramer rules 


a8 Q1.Normal constituent of the body Yes Class Low (Class I) 
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ae 2 4 


Supplementary figure 4(c)- C3 - Folic acid (Vitamin M / Vitamin B9) 


% Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 Loy) 


File Edit Chemical Compounds Toxic Hazard Method Help 


File: G: \VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\3. Vitamin M.mol 


Available structure attributes Toxic Hazard 
|High (Class 111) | 
\G:\VASANTH - Phd\Drug di... 


itoxTree. tree.cramer.Cram... | 1N,2N,3N,5N,6N,7Y,8N, 1... 


Low (Class I) 
Intermediate (Class II) 


High (Class IT) 


Structure diagram 
¥| Verbose explanation 


Cramer rules 
& Q1.Normal constituent of the body No 
88 Q2. Contains functional groups associated with enhanced toxicity No 
#8 Q3.Contains elements other than C,H.O.N.divalent S No 
a Q5.Simply branched aliphatic hydrocarbon or a common carbohydrate No 
88 Q6 Benzene derivative with certain substituents No 
#8 Q7.Heterocyclic Yes 
&@ Q8 Lactone or cyclic diester No 
&8 Q10.3-membered heterocycle No 
a Q11.Has a heterocyclic ring with complex substituents. Yes 
G2 Q33 Has sufficient number of sulphonate or sulphamate groups No Class High (Class III 


Supplementary figure 4(d)- C4 - Vitamin B1 (Thiamin) 


¥_ Toxtree (Estimation of Toxic Hazard - A'Decision Tree Approach) v2.6.0 | |) |e Son) 
File Edit Chemical Compounds Toxic Hazard Method Help 


File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - ChEMBL\4. Vitamin B1.mol 


Available structure attributes Toxic Hazard by Cramer rules 
Cramer rules |Low (Class I) | >) Estimate 
Names |G: \VASANTH - Phd\pru... 


toxTree. tree.cramer.C, os [EY 


Low (Class I) 


Intermediate (Class 11) 


High (Class IIT) 


Structure diagram 
¥| Verbose explanation 


Cramer rules 


&@ 1 Normal constituent of the body Yes Class Low (Class I) 
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Pace 2 5 


Supplementary figure 4(e)- C5 - 


L-Carnitine (Vitamin BT) 


\ Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 | =o jes | 


File Edit Chemical Compounds Toxic Hazard Method Help 
File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - ChEMBL\5. Vitamin BT.mol 


Available structure attributes Toxic Hazard by Cramer rules 


Cramer rules 
Names 


toxTree.tree.cramer.C... |1¥ 


Structure diagram 


Low (Class I) : ») Estimate 
(G: \VASANTH - Phd\pru... 


a Low (Class I) 


Intermediate (Class IT) 


High (Class II) 


¥| Verbose explanation 


Cramer rules 


rs) Q1.Normal constituent of the body Yes Class Low (Class I) 


Supplementary figure 4(f)- C6 - Poly-N-vinylpyrrolidone 


% Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 (ecole) 


File Edit Chemical Compounds Toxic Hazard Method Help 
File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\6. poly-N-vinylpyrrolidone.mol 


Available structure attributes Toxic Hazard by Cramer rules 


Cramer rules 
Names 


toxTree. tree.cramer.C. 


Structure diagram 
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.. |IN,2N,3Y,4N 


iHigh (Class 111) | 
\G:\VASANTH - Phd\pru... | 


>) Estimate 


Low (Class I) 


Intermediate (Class 11) 


High (Class I) 


¥| Verbose explanation 


Cramer rules 

aa Q1-Normal constituent of the body No 

& Q2 Contains functional groups associated with enhanced toxicity No 

aa Q3.Contains elements other than C.H.O.N.divalent S$ Yes 

aa Q4.Elements not listed in Q3 occurs only as a Na.K,Ca.Mg.N salt, 
sulphamate, sulphonate, sulphate, hydrochloride ... No Class High (Class IT) 
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Supplementary figure 4(g)- C7; Inulin 


Ye Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 (ecanlal=) mes | 


File Edit Chemical Compounds Toxic Hazard Method Help 


File: G:\VASANTH - Phd\Prug discovery - Insulin\1. Phd paper 1\7. Inulin 2 - zinc_4095703.sdf 


Available structure attributes Toxic Hazard 
|Low (Class 1) 
\ZINC 12358861 


itoxTree. tree.cramer.Cram... |1¥ Low (Class I) 


Intermediate (Class II) 


High (Class II) 


Structure diagram 
¥| Verbose explanation 


Cramer rules 


#8 Q1 Normal constituent of the body Yes Class Low (Class I) 


Supplementary figure 3(h)- C8 - Poly Cysteine 


\ Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 =o 


File Edit Chemical Compounds Toxic Hazard Method Help 


File: G: \VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - Drug bank\g. PolyCysteine.mol 


Available structure attributes ’ by Cramer rules 
|Low (Class I) 
|G:\VASANTH - Phd\Dru... 
|151 

toxTree. tree.cramer.C...|1Y 


Low (Class I) 


Intermediate (Class 11) 


High (Class II) 


Structure diagram 


¥| Verbose explanation 


Cramer rules 


a8 Q1_.Normal constituent of the body Yes Class Low (Class I) 
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Supplementary figure 4(i)- C9 — Chitosan 


% Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 moe _X_) 


File Edit Chemical Compounds Toxic Hazard Method Help 
File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - Chemspider\9. Chitosan.mol 


Toxic Hazard by Cramer rules 


Available structure attributes 
|1/0/N:34,33,31,30,26... | >) Estimate 


|High (Class IIT) Low (Class I) 

[C18 H35N3 013 

501.4828 

Veen ee | Intermediate (Class II) 
jO1[ce@] (TH) (CCH)L.. 

InChI=1S/C 18H35N30... 

Rem AM | High (Class 1) 

439300 


itoxTree. tree.cramer.C...|1N,2N,3N,5N,6N,7Y,8... 
= er ¥| Verbose explanation 


Cramer rules 

&@ Q1 Normal constituent of the body No 

&@ Q? Contains functional groups associated with enhanced toxicity No 

&@ Q3.Contains elements other than C.H,O.N. divalent S No 

aa Q5.Simply branched aliphatic hydrocarbon or a common carbohydrate No 
aa Q6 Benzene derivative with certain substituents No 

8 Q7 Heterocyclic Yes 

&@ Q8 Lactone or cyclic diester No 

aa Q10.3-membered heterocycle No 

&@ O11 Has a heterocyclic ring with complex substituents. Yes 

aa Q33.Has sufficient number of sulphonate or sulphamate groups No Class Hish (Class II) 


Supplementary figure 4(j)- C10 — Pectin 


YX Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 {| oh eS 
File Edit Chemical Compounds Toxic Hazard Method Help 


File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\10. Pectin.mol 
Available structure attributes Toxic Hazard by Cramer rules 
lLow (Class I) 
|G: \VASANTH - Phd\pru... 
toxTree. tree.cramer.C... |1¥ Low (Class I) 


Intermediate (Class 11) 


High (Class IT) 


¥| Verbose explanation 


Structure diagram Cramer rules 


a Q1.Normal constituent of the body Yes Class Low (Class I) 
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Supplementary figure 4(k)- C11 - Poly(propylene glycol) 


% Toxtree (Estimation of Toxic Hazard - A’Decision Tree Approach) v2.6.0 (el) 


File Edit Chemical Compounds Toxic Hazard Method Help 


File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\11. poly(propylene glycol) macromolecule .mol 


Available structure attributes Toxic Hazard by Cramer rules 
|High (Class IIT) | >) Estimate 
|G:\VASANTH - Phd)Dru... | 

toxTree. tree.cramer.C...|1N,2N,3N,5N,6N,7N,1... 


Low (Class I) 


Intermediate (Class IT) 


High (Class I) 


¥| Verbose explanation 


Cramer rules 


8 Q1 Normal constituent of the body No 


Structure diagram 


&a Q2.Contains functional groups associated with enhanced toxicity No 

aa Q3.Contains elements other than C.H.O.N. divalent S No 

& Q5.Simply branched aliphatic hydrocarbon or a common carbohydrate No 
& Q6_Benzene derivative with certain substituents No 

&@ Q7 Heterocyclic No 

& Q16.Common terpene No 

& Q17.Readily hydrolysed to a common terpene No 

G8 Q19 Open chain Yes 

#8 Q20 Aliphatic with some functional groups (see explanation) No 

i] Q22.Common component of food No 

#8 Q33.Has sufficient number of sulphonate or sulphamate groups No Class High (Class IIT) 


Supplementary figure 4(I)- C12 - Poly(propylene imine) 


% Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 (ele) [pen 
File Edit Chemical Compounds Toxic Hazard Method Help 


File: G: \VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\12. poly(propylene imine) macromolecule .mol 


Available structure attributes Toxic Hazard by Cramer rules 
Cramer rules |High (Class IIT) | >) Estimate 


Names |G:WASANTH -Phd\Dru. i 
itoxTree. tree.cramer.C...|1N,2N,3Y,4N 


Low (Class I) 


Intermediate (Class II) 


High (Class I) 


¥| Verbose explanation 


Cramer rules 

aa Q1-Normal constituent of the body No 

&a Q2.Contains functional groups associated with enhanced toxicity No 

&@ 3 Contains elements other than C.H.O.N.divalent S Yes 

a Q4.Elements not listed in Q3 occurs only as a Na.K.Ca.Mg.N salt, sulphamate. sulphonate. 
sulphate, hydrochloride ... No Class High (Class IT) 


Structure diagram 
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Supplementary figure 4(m)- C13 - Poly (lactic-co-glycolic acid) 


%_ Toxtree (Estimation of Toxic Hazard - A’Decision Tree Approach) v2.6.0 G=aEo) 


File Edit Chemical Compounds Toxic Hazard Method Help 
File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\13. Poly (lactic-co-glycolic acid) .mol 


Available structure attributes a a BE ee 
lLow (Class I) ») Estimate 
\G:\VASANTH - Phd\pru... 

toxTree. tree.cramer.C... | IN,2N,3N,5N,6N, 7N, 1... Low (Class I) 


Intermediate (Class II) 


High (Class I) 


V¥| Verbose explanation 


Structure diagram Cramer rules 


8 Q1 Normal constituent of the body No 


88 Q2 Contains functional groups associated with enhanced toxicity No 
t.] Q3.Contains elements other than C.H,O.N, divalent S No 

#8 Q5.Simply branched aliphatic hydrocarbon or a common carbohydrate No 
&8 Q6 Benzene derivative with certain substituents No 

88 Q7 Heterocyclic No 

88 Q16.Common terpene No 

aa Q17 Readily hydrolysed to a common terpene No 

&@ Q19 Open chain Yes 

#8 (Q20.Aliphatic with some functional groups (see explanation) Yes 
&8 Q21.3 or more different functional groups No 

#8 Q18.One of the list (see explanation) No Class Low (Class I) 


Supplementary figure 4(n)- C14 - Deoxycholic acid 


% Toxtree (Estimation of Toxic Hazard - A Decision Tree Approach) v2.6.0 | o/e 


File Edit Chemical Compounds Toxic Hazard Method Help 


File: G:\VASANTH - Phd\Drug discovery - Insulin\1. Phd paper 1\Carriers - CheBi\14. Deoxycholic acid .mol 

Available structure Toxic Hazard by Cramer rules 
attributes >) Estimate 

|High (Class III) 

G:\VASANTH ... 

. |1N,2N,3N,5N... 


Low (Class I) 


Intermediate (Class I) 


High (Class IT) 


¥| Verbose explanation 


Structure diagram Cramer rules 


&@ Q1 Normal constituent of the body No 

aa Q2.Contains functional groups associated with enhanced toxicity No 

aa Q3.Contains elements other than C,H.O.N.divalent S No 

& Q5.Simply branched aliphatic hydrocarbon or a common carbohydrate No 
88 Q6 Benzene derivative with certain substituents No 

#8 Q7 Heterocyclic No 

) Q16.Common terpene No 

&@ Q17 Readily hydrolysed to a common terpene No 

8 Q19 Open chain No 

&@ Q23_Aromatic No 

& Q24.Monocarbocyclic with simple substituents No 

a Q25.Cyclopropane, etc. (see explanation) No 

a Q26.Monocycloalkanone or a bicyclocompound No 

&@ Q22.Common component of food No 

&@ Q33 Has sufficient number of sulphonate or sulphamate groups No Class High (Class IIT) 
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Supplementary figure 5 

Conjugation results of Human Insulin Monomer (PDB ID: 1HLS), with all listed drug delivering molecules individually by Discovery Studio 
software. C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - Folic acid (Vitamin M / Vitamin B9); C4 - Vitamin B1 
(Thiamin); C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly Cysteine; C9 — Chitosan; C10 — Pectin; 
C11 - Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic acid); C14 - Deoxycholic acid 


Supplementary figure 5(a)- C1 - Vitamin B12 (cobalamin) 
No Conjugation 


Supplementary figure 5(b)- C2 - Vitamin H (Biotin) 

Maa = 

Wes 

VY Protein Groups 

8} 1HLS - Monomer_model_1 
&} 1HLS - Monomer_model_2 
8} HLS - Monomer_model_3 
8} 1HLS - Monomer_model_4 
8} 1HLS - Monomer_model_5 
&} 1HLS - Monomer_model_6 
8%} 1HLS - Monomer_model_7 
8} HLS - Monomer_model_8 
8} 1HLS - Monomer_model_9 
%} 1HLS - Monomer_model_10 
8} 1HLS - Monomer_model_11 
8} HLS - Monomer_model_12 
8} 1HLS - Monomer_model_13 
8%} 1HLS - Monomer_model_14 
8} 1HLS - Monomer_model_15 
3 HLS - Monomer_model_16 
8} 1HLS - Monomer_model_17 
8%} 1HLS - Monomer_model_18 


8} 1HLS - Monomer_model_19 
8} 1HLS - Monomer_model_20 [Activ 
V7) $2 Site1 
@ SED_site Sphere 
V|@) SBD_Receptor 
¥] ¢*. Compoundl 
YZ)” <HBondMonitor> 
¥| e« Compoundl:H19 - B:GLN4:0 
¥| e-« Compoundi:H20 - A:CYS11:SG 
¥) e@ Compound1:H31 - B:HISLO:0 


‘ i > 


Name ID Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter & ID# 1-Letter & ID# & Insertion Code 3-Letter 3-Lett * 
SF z 7 aris = * ree aoe 3 a7 ar a SOF 
10 1E10 10 Vives WV] Yes A Isoleucine TE I 110 110 Tle Tle10 
11 cysi1 11 J) Yes Vives B A Cysteine cys c cu ci cys Cysi1 
12 SER12 12 T)Yes Vives |_| A Serine SER s $12 $12 Ser Ser12 
13 LeU13 3B Wives Vi Yes |_| A Leucine Leu L L13 113 Leu Leu13 
14 TYR14 14 No Wi Yes |_| A Tyrosine TYR Y yi4 yi4 Tyr Tyri4 - 


‘ 7 » 


\wolecule /\_Atom_/\_AminoAcd /\_Bond /\_Group /\_Conformation /\_BindinasitePoint /\_Bindinasite /\_ProteinSequence /\_AminoAcdChain /\_SphereObiect /\_HBondMonitor /\_HBond /\_Label J 


Supplementary figure 5(c)- C3 - Folic acid (Vitamin M / Vitamin B9) 
No Conjugation 
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Supplementary figure 5(d)- C4 - Vitamin B1 (Thiamin) 
ce i oe, °° 
&% oie Groups \ 


By 1HLS - Monomer_model_1 


Bt 
By 1HLS - Monomer_model_2 
By. 1HLS - Monomer_model_3 
B}_ 1HLS - Monomer_model_4 
By 1HLS - Monomer_model_5 
By 1HLS - Monomer_model_6 
By 1HLS - Monomer_model_7 
By. 1HLS - Monomer_model_8 
By 1HLS - Monomer_model_9 
By 1HLS - Monomer_model_10 
By 1HLS - Monomer_model_11 
By 1HLS - Monomer_model_12 
By 1HLS - Monomer_model_13 
B} 1HLS - Monomer_model_14 = 
B} 1HLS - Monomer_model_15 
By 1HLS - Monomer_model_16 
By 1HLS - Monomer_model_17 
By 1HLS - Monomer_model_18 
By 1HLS - Monomer_model_19 
By 1HLS - Monomer_model_20 
Re? Site1 


@ 8D Site Sphere 
SBD_Receptor 

Compound 

<HBondMonitor> 

p- A:CYS11:H - Compoundl:017 
pe B:LEU6:H - Compound1:017 
>» Compoundl:H35 - A:CYS6:0 


« i , 
Name i) Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter &ID# — 1-Letter & ID# & Insertion Code 3-Letter 3-Lett * 

5 GINS 5 No /| Yes A Glutamine GIN Q Qs Qs Gin Gins 

6 cys6 6 Yes Yes A Cysteine cys c ce c6 Cys Cys6 

7 +cyYs7 7 No Yes A Cysteine cys c c7 c7 cys Cys7 

8 THRS 8 No Yes A Threonine THR. T Ts Ts Thr Thr8 

9 SERS 9 No Yes A Serin SER s eS) 39 Ser Sero 

4M nein 40 Van Minn a ne 7 Tan Tan Te tain = 

« , 


tePoint /\_BindinaSite /\_ProteinSequence_/\_AminoAcidChain /\_SphereObiect /\_HBondMonitor /\_HBond /\_Label_ J 


\Wolecule /\_Atom /\_AminoAcid /\_gond /\_Group /\_Conformation /\_Bindinasi 


Supplementary figure 5(e)- C5 - L-Carnitine (Vitamin BT) 


@ SBD_Receptor 
gs Compoundl 
ea 
Ce) 
° 03 
me 
26 
2 06 
oa 
2 08 
fac) 
NiO 
e cil 
@ H12 
@ H13 
° Hid 
° HIS 
@ H16 
@ Hi7 
° H18 
° HI9 
@ H20 
° Ha 4 
° H22 
2 H23 
@ H24 
o H25 
o H26 
<HBondMonitor> 
@@ A:LEU13:H - Compound1:03 
2 Compoundi:H20 - B:VAL2:0 


‘ it , 
Name 1D Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter &ID# —1-Letter & ID# & Insertion Code 3-Letter 3-Lett: * 

aa Cio ua ies ves w ” Cysiene or = rest erst cys cysis 

12 SER12 12 Yes Yes |_| A Serine SER s $12 $12 Ser Seri2 

13 LEU13 Yes Yes a A Leucine LEU L L13 L13 Leu Leu13 

14 TYR14 No Yes | | A Tyrosine TYR Y y14 y14 Tyr Tyri4 

15 GLNI5 No Yes A Glutamine GIN Q qus Qs Gin Ginis 

16 LEUI6 16 Yes Yes |_| A Leucine LEU L L16 L16 Leu leu6 


TT 
\ Molecule /\_Atom /\_AminoAcid /\_Bond /\_Group /\_Conformation /\_BindinasitePoint /\_Bindinasite /\_ProteinSequence /\_AminoAcidChain /\_SphereObiect /\_HBondMonitor_ /\_HBond /\_Label J 


Supplementary figure 5(f)- C6 - Poly-N-vinylpyrrolidone 
No Conjugation 
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Supplementary figure 5(g)- C7; Inulin 


%} HLS - Monomer_model_7 a 


8} 1HLS - Monomer_model_8 

8} 1HLS - Monomer_model_9 

%} HLS - Monomer_model_10 
8} 1HLS - Monomer_model_11 
8} 1HLS - Monomer_model_12 
8} 1HLS - Monomer_model_13 
8} 1HLS - Monomer_model_14 
8} 1HLS - Monomer_model_15 
8} 1HLS - Monomer_model_16 
8} 1HLS - Monomer_model_17 
8} 1HLS - Monomer_model_18 
8} 1HLS - Monomer_model_19 
8} 1HLS - Monomer_model_20 
> site1 
@ SBD_site Sphere 

@) SBD_Receptor 

gs ZINC04095703 

y*s ZINC04095703 

ZINC04095703 : 

g*> ZINC04095703 

g*s ZINC04095703 

y*> ZINC04095703 

y’s ZINC04095703 

y’s ZINC04095703 

gs ZINC04095703 

y*s ZINC04095703 

<HBondMonitor> 

e« ZINC04095703 

@» ZINC04095703:H39 - B:GLN4:0 

ee ZINC04095703:H42 - A:CYS11:0 = 


Name iD Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter & ID 1-Letter & ID# & Insertion Code 
21 ASN21 21 No Yes A Asparagine ASN N N21 
22 PHE1 1 Yes Yes B Phenylalanine PHE F Fi 
23 VAL2 2 Yes Yes |_| B Valine VAL v v2 
24 ASN3 3 Yes Yes B Asparagine ASN N N3 
25 GLN4 4 Yes Yes B Glutamine GLN Q Q4 
26 HISS 5 No Yes B Histidine HIS H HS 


‘ MH 
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Supplementary figure 5(h)- C8 - Poly Cysteine 


*, 1HLS - Monomer 
BRE ProteinSequence 
maa 
mts B 
Y Protein Groups 
%} 1HLS - Monomer_model_1 
§} HLS - Monomer_model_2 
8} 1HLS - Monomer_model_3 
8} 1HLS - Monomer_model_4 
8} 1HLS - Monomer_model_5 
8} 1HLS - Monomer_model_6 
8} 1HLS - Monomer_model_7 
8} 1HLS - Monomer_model_8 
8} 1HLS - Monomer_model_9 
8% 1HLS - Monomer_model_10 
8} 1HLS - Monomer_model_11 
8} 1HLS - Monomer_model_12 
8} 1HLS - Monomer_model_13 
8% 1HLS - Monomer_model_14 
8} 1HLS - Monomer_model_15 
8} 1HLS - Monomer_model_16 
8} 1HLS - Monomer_model_17 
&% 1HLS - Monomer_model_18 
} 1HLS - Monomer_model_19 
%} 1HLS - Monomer_model_20 
8 Sitel 
@ SED_Site Sphere 
§) SBD_Receptor 
*. Compoundl 
b* <HBondMonitor> 
J] e-* Compoundl:Hi1 - B:GLN4:0 
2 Compound1:H12 - B:GLN4:0 


« ii , 
Name ID Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter &ID# — 1-Letter & ID# & Insertion Code 

23 vaLZ 2 ies ves = B vane VAL v ve vz 

24 ASN3 3 Yes Yes B Asparagine ASN N N3 N3 

25 GLN4 4 Yes Yes B Glutamine GLN Q Q4 Q4 

26 HISS 5 No Yes 8 Histidine HIS H HS HS 

27 _LEU6 6 /\Yes Yes _ B Leucine LeU L Le Lé 


‘ m7 


\Wolecule /\_Atom /\_AminoAcid /\_Bond /\_Group /\_Conformation /\_BindinaSitePoint /\_BindingSite 


'\ ProteinSequence _/\_AminoAcidChain /\_SphereObiect /\_HBondMonitor_ /\_HBond /\_Label J 
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3-Letter 


Asn 
Phe 
Val 
Asn 
Gin 
His 


3-Letter 


3-Lett: * 


Asn21 


Phe 
Val2 

Asn3 
Gin 


HisS 
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Supplementary figure 5(i)- C9 — Chitosan 


ProteinSequence - 
lA 

B 

Protein Groups 

HLS - Monomer_model_1 

LHLS - Monomer_model_2 

LHLS - Monomer_model 
LHLS - Monomer_model_4 
HLS - Monomer_model_5 
HLS - Monomer_model_6 
ALHLS - Monomer_model_7 


LHLS - Monomer_model 8 
HLS - Monomer_model_9 
HLS - Monomer_model_10 
HLS - Monomer_model_11 
LHLS - Monomer_model_12 
[HLS - Monomer_model_13 
HLS - Monomer_model_14 
LHLS - Monomer_model_15 
LHLS - Monomer_model_16 
LHLS - Monomer_model_17 
LHLS - Monomer_model_18 
LHLS - Monomer_model_19 
LHLS - Monomer_model_20 
Site1 

|SBD_Site Sphere 

Receptor 

hpound1 

BondMonitor> 
Compoundl:H18 - A:CYS11:0 
Compoundl:H24 - B:VAL2:0 
‘Compoundl:H25 - B:ASN3:0D1 


“ ii > 
Name ID Visible Visibility Locked Color Parent Type PDB Name 1-Letter I-Letter &ID# —_1-Letter & ID# & Insertion Code 3-Letter 3-Lett * 

22 PHE1 1 Yes Yes B Phenylalanine PHE F Fi Phe Phel 

23 VAL2 2 Yes Yes i | B Valine VAL v v2 Val val2 

24 ASN3 3 Yes Yes 8 Asparagine ASN N N3 Asn Asn3 

25 GLN4 4 Yes Yes B Glutamine GIN Q Qs Gin Ging 

26 HISS 5 No Yes 8 Histidine HIS H HS His HisS + 

« it , 


\ Molecule /\_Atom /\_AminoAcd /\_Bond /\_Group /\_Conformation /\_BindinasitePoint /\_Bindinasite_/\_ProteinSequence_/\_AminoAcidChain /\_SphereObiect /\_HBondMonitor_ /\_HBond J\_Label J 


Supplementary figure 5(j)- C10 — Pectin 


G, 1HLS - Monomer 

88E ProteinSequence 
Mea 
wt .s 

Y% Protein Groups 

83 1HLS - Monomer_model_1 
8} HLS - Monomer_model_2 
8%} 1HLS - Monomer_model_3 
8} 1HLS - Monomer_model_4 
8} HLS - Monomer_model_5 
1HLS - Monomer_model_6 


1HLS - Monomer_model_7 
8} 1HLS - Monomer_model_8 


7t 
3 1HLS - Monomer_model_9 
3 1HLS - Monomer_model_10 


%} 1HLS - Monomer_model_11 
8} 1HLS - Monomer_model_1 


1HLS - Monomer_model_13 
1HLS - Monomer_model_14 
1HLS - Monomer_model_15 
8% 1HLS - Monomer_model_16 
8} HLS - Monomer_model_17 
1HLS - Monomer_model_18 
8} 1HLS - Monomer_model_19 
§} HLS - Monomer_model_20 
@ Site1 

@ S8D_site Sphere 
SBD_Receptor 

Compound 
<HBondMonitor> 

2 Compoundl:H21 - A:CYS11:0 


Name i) Visible Visibility Locked Color Parent Type PDB Name 1-Letter L-Letter &ID# —1-Letter & ID# & Insertion Code 3-Letter 3-Lett 
ae : i = — = ne “=e s = = a ee 
10 E10 10 Yes Yes A Isoleucine Ie I 110 110 le Tle10 
11 cysi1 J] Yes Yes {| A Cysteine cys iC cit cit Cys Cysi1 
12 SER12 Yes Yes | | A Serine SER s $12 $12 Ser Ser12 
13 LeU13 Yes Yes Hi A Leucine LEU 3 L13 LB Leu leu13 = 
« i , 
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Supplementary figure 5(k)- C11 - Poly(propylene glycol) 


1HLS - Monomer 
BRE ProteinSequence 


fe B 
¥Y, Protein Groups 
8} HLS - Monomer_model_1 
8} 1HLS - Monomer_model_2 
&} 1HLS - Monomer_model_3 
%} 1HLS - Monomer_model_ 4 
&} 1HLS - Monomer_model_5 
%} 1HLS - Monomer_model_6 
%} 1HLS - Monomer_model_7 
%} 1HLS - Monomer_model 8 
%} 1HLS - Monomer_model_9 
8% 1HLS - Monomer_model_10 
8} 1HLS - Monomer_model_11 
8} HLS - Monomer_model_12 
%} 1HLS - Monomer_model_13 
%} 1HLS - Monomer_model_14 
%} 1HLS - Monomer_model_15 
&} 1HLS - Monomer_model_16 
8} 1HLS - Monomer_model_17 
8} 1HLS - Monomer_model_18 
8} 1HLS - Monomer_model_19 
8} 1HLS - Monomer_model_20 
@ Site1 
@ SED_Site Sphere 
1 SBD_Receptor 
Compoundl 
<HBondMonitor> 
&% Compoundl:H16 - B:VAL2:0 


‘ T , 


Name 1D Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter &ID# —1-Letter & ID# & Insertion Code 3-Letter 3-Letti * 
22 PHEL 1 Yes Yes 8 Phenylalanine PHE F Fl Fi Phe Phe1 
23 VAL2 2 Yes Yes B B Valine VAL Vv v2 v2 Val Val2 
24 ASN3 3 Yes Yes 8 Asparagine ASN N NB NB Asn Asn3 
25 GUN4 4 Yes Yes 8 Glutamine GIN Q a4 Q4 Gh Gin¢ 
26 HISS 5 No Yes 8 Histidine HIS H HS HS His Hiss 
27 _LEU6 6 Yes Yes i_| B Leucine LeU L L6 le Leu Leus * 
4 it , 


\Wolecule /\_Atom /\_AminoAcd /\_gond /\_Group /\_Conformation /\_sindinasitePoint /\_gindinasite /\_ProteinSequence /\_AminoAcdChain /\_SphereObiect /\_HBondMonitor /\_HBond /\ Label 


Supplementary figure 5(I)- C12 - Poly(propylene imine) 
No Conjugation 


Supplementary figure 5(m)- C13 - Poly (lactic-co-glycolic acid) 


! 1HLS - Monomer_model_13 “ 
} 1HLS - Monomer_model_14 
1HLS - Monomer_model_15 
+ 1HLS - Monomer_model_16 
1HLS - Monomer_model_17 
} 1HLS - Monomer_model_18 
1HLS - Monomer_model_19 
} 1HLS - Monomer_model_20 
F Site 
) SBD_Site Sphere 
$D_Receptor 
pmpoundl 
a 
02 
a 
a 


o5 


H16 

H17 

H18 
HBondMonitor> 
» Compoundl:H15 - A:CYS11:0 
» Compound1:H18 - B:PHE1:0 


Name iD Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter & ID# 1-Letter & ID# & Insertion Code 3-Letter 3-Lett: * 
21 ASN21 21 No Yes A Asparagine ASN N N21 N21 Asn Asn21 
22 PHE1 1 Yes Yes B Phenylalanine PHE F Fi F1 Phe Phet 
23 VAL2 2 /) Yes Yes B Valine VAL v v2 v2 Val val2 
24 ASN3 3 Yes Yes 8 Asparagine ASN N N3 N3 Asn Asn3 + 
‘ i , 
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Supplementary figure 5(n)- C14 - Deoxycholic acid 
No Conjugation 
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Supplementary figure 6 

Conjugation results of Human insulin hexamer (PDB ID: 1AIO), with all listed drug delivering molecules individually by Discovery Studio 
software. C1 - Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - Folic acid (Vitamin M / Vitamin B9); C4 - Vitamin B1 
(Thiamin); C5 - L-Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly Cysteine; C9 — Chitosan; C10 — Pectin; 
C11 - Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic acid); C14 - Deoxycholic acid 


Supplementary figure 6(a)- C1 - Vitamin B12 (cobalamin) 
No Conjugation 


Supplementary figure 6(b)- C2 - Vitamin H (Biotin) 
8} 1AID-Hexamer_model5 + a 
} LAID - Hexamer_model_6 
8} LAID - Heamer_model_7 
&} 1Al0 - Hexamer_model 8 
8} 1Al0 - Hexamer_model_9 
8} LAID - Hexamer_model_10 


# Site13 
@ S8D_site Sphere 
V7] @) SBD_Receptor 
| @ Structure Monitorl 
| gs Compoundl 
WZ] ¢*s Compoundl 
4 [@) & <HBondMonitor> 
VJ] ee J:GLN4:HE21 - Compounc 
V7] e-« J:GLN4:HE21 - Compounc 
V7] @-@ L:ARG22:H - Compoundl: 
W]e L:ARG22:H - Compoundl: 
J] Compoundl:H31 - L:GLU: | J 


Name ID Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter & ID+ 1-Letter & ID+ & Insertion Code 3-Letter 3-Let *| 
2 mez 2 No Vives A Isoleucine IE I 12 12 Tle Te2 
3 VALS 3 No Vives B A Valine VAL v v3 v3 Val val3 
4 Gua 4 No Vives | A GlutamicAcd Guu E E4 E4 Glu Glu4 
5 GUNS 5 No Vives B A Glutamine GUN Q Q5 Q5 Gin Ging 
6 cysé6 6 No Vi Yes B A Cysteine cys c ce ce cys Cys6 
7 cys7 7 No V] Yes |_| A Cysteine cys iG c7 c7 Cys Cys? 
8 THRE 8 No Vives i_| A Threonine THR T 138 138 Thr Thra ~ 
« i 
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Supplementary figure 6(d)- C4 - Vitamin B1 (Thiamin) 
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Name ID Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter &ID# —_1-Letter & ID# & Insertion Code 3-Letter 3-Let * 
297 GLU21 21 No Yes | | L GlutamicAcid GLU E E21 E Glu Glu21 
298 ARG22 22 Yes Yes L Arginine ARG R R22 R22 Arg Arg22 
299 GLY23 23 Yes Yes L Glycine Guy G 623 623 Gly Gly23 
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302 TYR26 26 Yes Yes |_| L Tyrosine TYR y Y26 Y26 Tyr Tyr26 
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Supplementary figure 6(f)- C6 - Poly-N-vinylpyrrolidone 
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Supplementary figure 6(h)- C8 - Poly Cysteine 
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Supplementary figure 6(j)- C10 — Pectin 
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Supplementary figure 6(k)- C11 - Poly(propylene glycol) 
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Supplementary figure 6(I)- C12 - Poly(propylene imine) 
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Supplementary figure 6(n)- C14 - Deoxycholic acid 
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Supplementary figure 7 

Conjugation results of Insulin Lispro (PDB ID: 1 LPH), with all listed drug delivering molecules individually by Discovery Studio software. C1 - 
Vitamin B12 (cobalamin); C2 - Vitamin H (Biotin); C3 - Folic acid (Vitamin M / Vitamin B9); C4 - Vitamin B1 (Thiamin); C5 - L- 
Carnitine (Vitamin BT); C6 - Poly-N-vinylpyrrolidone; C7; Inulin; C8 - Poly Cysteine; C9 — Chitosan; C10 — Pectin; C11 - 
Poly(propylene glycol); C12 - Poly(propylene imine); C13 - Poly (lactic-co-glycolic acid); C14 - Deoxycholic acid 
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Supplementary figure 7(d)- C4 - Vitamin B1 (Thiamin) 
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Supplementary figure 7(e)- C5 - L-Carnitine (Vitamin BT) 


4 @ & <Cell> 
4 (¥)%, 1LPH - Insulin Lispro - Monom 
7] BRE ProteinSequence 
Baa 
Re B 
fe Cc 
& D 
tes 
Re C 
fe D 
& Water 
&) Hetatm 
&Y Protein Groups 
@ Sitel 
@ Site2 
@ Site3 
& Site 4 
@ SED_site Sphere 
Y| @ Structure Monitorl 
¥] @) SBD_Receptor 
V7] Compound 
_| <HBondMonitor> 
¥) e* C:GLU4:N - Compound1:03 
VJ] &* D:GLN4:NE2 - Compound1:03 
© Water:HOH43:0 - Compound] 


Li} 


SSS S EE oe 


OP oP oF o® 


‘ , 
Name D Visible Visibility Locked Color Parent Type PDB Name 1-Letter 1-Letter & ID# 1-Letter & ID & Insertion Code 3-Letter 3-Let * 
54 VAL3 3 Vi) Yes V7 Yes c Valine VAL v v3 v3 Val Val3 
55 GLu4 4 vi Yes Vv) Yes i | c GlutamicAcid GLU iz E4 E4 Glu Glut 
56 GLNS 5 Ji Yes Vi Yes c Glutamine GLN Q Qs Qs Gin Gins 
57 CYS6 6 Vv] Yes J] Yes 2 Cysteine cys c cé C6 Cys Cys6 
5R__cys7 2 Tl Vec Tivee a c Custeine cvs c 7 7 Cus Cve7 ~ 
> 


aa 


< 
\ Molecule /\_Atom /\_Aminoacid /\ Residue /\ Bond /\_Group /\ BindingsitePoint /\_Bindingsite /\_ProteinSequence /\_valence /\_Cell /\_AminoAcidchain /\_Chain /\_SphereObiect /\_StructureMonitor /\_Label_/\_HBondMonitor /\_HBond J 


Supplementary figure 7(f)- C6 - Poly-N-vinylpyrrolidone 
No Conjugation 


paveZ 43 


© The Author(s) 2015. Open Access. This article is licensed under a Creative Commons Attribution License 4.0 (CC BY 4.0) 


Supplementary figure 7(g)- C7; Inulin 
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Supplementary figure 7(i)- C9 — Chitosan 
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Supplementary figure 7(j)- C10 — Pectin 
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Supplementary figure 7(k)- C11 - Poly(propylene glycol) 


4 @® <Cell> 
4 [@&%. 1LPH- Insulin Lispro - Monome 
7) BRE ProteinSequence 


Bea 
we 


2 Water 
& Hetatm 
¥| ¥% Protein Groups 
Qe Site1 
QE Site2 
&* Site3 
@ Sited 

@ SBD_site Sphere 
SBD_Receptor 
Structure Monitort 


*, CompoundL 


4 [@).# <HBondMonitor> 
YJ] C:GLY1:N - Compoundl:08 


Name 1D Visible Visibility Locked Color Parent Type PDB Name 1-Letter I-Letter &ID# —1-Letter & ID# & Insertion Code 3-Letter 3-Let * 
52 GLY1 1 vi Yes ¥) Yes c Glycine GLY G G1 G1 Gly Gly1 
53 1LE2 2 ZiYes ZiYes c Isoleucine Ie I 12 12 lle le2 
54 VAL3 3 WiYes Ziyes c Valine VAL v v3 v3 Val Vag. ~ 


4 Mi 


\ Molecule /\_Atom /\_AminoAcid /\ Residue /\_Bond /\_ Group _/\_BindinasitePoint /\_Bindinasite_/\_ProteinSequence /\_Cell_/\_AminoAcidChain /\_chain /\_SphereObiect /\_StructureMonitor_/\_HBondMonitor_/\_HBond /\_Label_/ 


Supplementary figure 7(I)- C12 - Poly(propylene imine) 
No Conjugation 


Supplementary figure 7(m)- C13 - Poly (lactic-co-glycolic acid) 
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Supplementary figure 8 
Supplementary figure 8(a) 


Interaction results of Oral insulin conjugates (Insulin Monomer (1HLS)- DDM Conjugates) with Insulin Receptor (IR). It does not show 
any interaction in leucine-rich repeat domain (L1, residues 1-157) and in C-terminus of the a-chain (aCT, residues 704-715). 
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Supplementary figure 8(b) 

Interaction results of Oral insulin conjugates (Insulin Hexamer (1AIO)- DDM Conjugates)- DDM Conjugates) with Insulin Receptor (IR). It 
shows the interaction in ARG86, ASN34 of leucine-rich repeat domain (L1, residues 1-157) and no interaction in C-terminus of the a-chain 
(aCT, residues 704-715). 
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Supplementary figure 8(c) 
Interaction results of Oral insulin conjugates (Insulin Lispro (1LPH) - DDM Conjugates) with Insulin Receptor (IR). It shows the 


interaction in ARG86, ASN90 and ARG114 of leucine-rich repeat domain (L1, residues 1-157) and no interaction in C-terminus of the a- 
chain (aCT, residues 704-715). 


fs, 3.1LPH (Insulin Lispro) congugate - 20TG ( 
E85 ProteinSequence 


> 


monn 


<Chain> 

@) Hetatm 

% Protein Groups 
a Interaction HBonds 

2 E:ARG86:HH11 - :UD 

@ E:ARG86:HHLI - :U 
@ E:ARG86:HH12 - :UNKI 
@ E:ARG86:HH12 - :UN! 
2 E:ARG86:HH21 - :UNK‘ 
@ E:ASN90:HD21 - :UNKO:TYR14 
@ E:ASN90:HD21 - :UNK0:0838 
@ E:ARG114:HH21 - :UNKO:GLU17 
E:ARG114:HH22 - :UNKO:GLUI7 |= 
E:GLU287:H - :UNK0:0862 
@ E:SER290:HG - :UNK0:0863 
@ E:CYS308:H - :UNKO:GLU4 
© E:CYS308:H - :UNKO:GLU4 
2 E:AR' HE - :UNKO:THR27 
@ E:CYS524:H - :UNK0:0822 
UNKO:THRS - E:CYS304:0 
@ :UNKO:ASN18 - E:GLU355:0E2 
@ :UNKO:ASN21 - E:GLU353:OE1 
@ :UNK0:0809 - E:LEU305:0 
UN 1809 - E:GLY306:0 
» :UNK0:0809 - E:PRO307:0 
UN ~ E:CYS524:0 
2 :UNI 2 - E:CYS524:SG 
2 :UN 7 - E:ASP522:0 
© :UNK0:0846 - E:GLU141:0E1 
@ :UNK0:0846 - E:GLU141:0E2 
@ :UNK0:0862 - E:GLU287:0 
@ :UNK0:0863 - E:SER290:0G 


(a a ae ae ee 


f 


ae a8 


hg 


CLOTI LCT EER? 


& Interaction Contacts 
® Pi-Pi Interactions - 


4 Tm , 


3, 1LPH (Insulin Lispro) congugate - 2DTG ( « 


88 ProteinSequence 


Oon@OP 


E 


¥] @® <Chain> 

7] @) Hetatm 

¥] % Protein Groups 

@ .** Interaction HBonds 

W]e E:ARG36:HHL1 - :UNKO:ARG22 

ee E:ARG86:HHL1 - :UNKO:ARG22 
@-2 E:ARG86:HH12 - : UNKO:ARG22 
e-e E:ARG86:HH12 - :UNKO:ARG22 
ea E:ARG86:HH21 - :UNKO:ARG22 
@-» E:ASN9O:HD21 - :UNKO:TYRI4 
@-2 E:ASN90:HD21 - :UNK0:0838 
e-e E:ARGI14:HH21 - :UNKO:GLU17 

W]e E:ARGL14:HH22 - :UNKO:GLU17 
oo 
oo 
oo 
oo 
oo 
oo 
oo 


E:GLU287:H - :UNK0:0862 
E:SER290:HG - : UNK0:0863 
E:CYS308:H - ;UNKO:GLU4 
E:CYS308:H - ;UNKO:GLU4 
E:ARG331:HE - :UNKO:THR27 
E:CYS524:H - :UNK0:0822 
sUNKO:THRS - E:CYS304:0 

¥| e« :UNKO:ASN18 - E:GLU355:0E2 
¥| e- :UNKO:ASN21 - E:GLU353:OE1L 
oo ~ E:LEU305:0 
ee :UNKO0:0809 - E:GLY306:0 
2 :UNK0:0809 - E:PRO307:0 
@-@ :UNK0:0822 - E:CYS524:0 
oo 

oo 

oo 


~ E:CYS524:SG 
:UNKO:0827 - E:ASP522:0 
sUNK0:0846 - E:GLU141:0E1 
@-@ :UNK0:0846 - E:GLU141:0E2 
@ :UNK0:0862 - E:GLU287:0 
2 :UNKO0:0863 - E:SER290:0G 
4 Interaction Contacts 
@® Pi-Pi Interactions - 


© The Author(s) 2015. Open Access. This article is licensed under a Creative Commons Attribution License 4.0 (CC BY 4.0) 


paveZ 49 


